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TASTE RECEPTORS OF THE T1R FAMILY FROM DOMESTIC CAT 

FIELD OF THE INVENTION 

[0001] The present invention relates to the field of sensory mechanisms of the domestic cat, 
Felis catus. The invention relates, for example, to the discovery of several genes of Felis catus 
encoding taste receptors of the T1R family, T1R1 (or TaslRl), T1R2 (or TaslR2), and T1R3 (or 
TaslR3). The invention further relates to the polypeptides encoded by the feline T1R1, T1R2, 
and T1R3 genes and to methods and uses of the same. 

BACKGROUND OF THE INVENTION 

[0002] The sense of taste is important for determining food choice, for regulating food intake, 
and for ensuring efficient use of ingested nutrients. Taste can act as a warning system for the 
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presence of potentially harmful foods, by, for example, the aversive sensations of sourness or 
bitterness, and as an attractant to potentially nutrient-rich foods, by, for example, the appealing 
sensations of sweetness, saltiness, and umami. 

[0003] Taste stimuli are received by taste receptor cells assembled into taste buds that are 
located in the epithelium of taste papillae of the tongue (Kitagawa et ai, Bioch. Bioph. Res. 
Comm., 283:236-242 (2001)). The stimuli are believed to be transduced by taste receptors at the 
surface of the taste receptor cells (Id.). The taste receptors encoded by the genes of a given 
species are reflective of that species' food choices. For example, the "sweet receptors" of an 
herbivorous species are expected to be different from those of a carnivorous species, since the 
two consume completely different diets whose foods contain different primary stimuli. Since 
taste receptor specificity likely reflects food choice, it follows that receptor sequence homology 
among species may be as predictive or more predictive of food preferences of a given species as 
phylogenetic relatedness among species. 

[0004] Evolution has provided that each species' genes code for taste receptors unique to that 
species' food choices. For example, the "sweet receptors" of an herbivore are expected to be 
different from those of a carnivore, since the two consume completely different diets whose 
foods contain different primary stimuli. Since taste receptor specificity must reflect food choice, 
it may follow that receptor sequence homology among species might be dependent more upon 
the types of foods consumed by individual species rather than by the phylogenetic relatedness of 
species. The behavior of carnivores, such as the domestic cat, towards stimuli such as sweet 
carbohydrates, which it cannot taste (Beauchamp, et al. 9 J. Comp. Physiol. Psychol., 91(5):1118- 
1127 (1977)), and towards L-amino acids, which it can taste, should be explainable based on the 
specificity of the taste receptors of carnivores in general. The behavior of the domestic cat (Felis 
catus), a carnivore, towards stimuli such as sweet carbohydrates, which it generally cannot taste, 
and towards L-amino acids, which it generally can taste, should be explicable by the specificity 
of taste receptors of other carnivores. Direct knowledge of taste receptor genes will allow insight 
into an animal's sensory world and may be useful for identifying modulators of the taste 
receptors encoded thereby to influence an animal's taste preferences. 

[0005] Molecular receptors for the taste element of sweetness have recently been identified 

from human, mouse, and rat. Thus far, there are three known members of the T1R taste receptor 

family: T1R1, T1R2, and T1R3 (Montmayeur & Matsunami, Curr. Opin. Neurobiol, 12(4):366- 

371 (2002)). The T1R3 receptor gene is located within the Sac locus, the primary genetic locus 

controlling preference for sweet-tasting stimuli in mice (Li et aL 9 Mamm. Genome, 12(1):13-16 
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(2001); Li et al. 9 Mamm. Genome, 13(1):5-19 (2002)). The human syntenic region for mouse 
T1R3 gene is on lp36.33 (1 162-1 186kb). The gene for T1R1 is located on human lp36.23 
(6324-6349kb), which is -5Mb from T1R3, and that for T1R2 is located on human lp36.13 
(1 8483-1 8729kb), which is ~12Mb from T1R1. 

[0006] Most of the TIRs are G-protein coupled receptors with long N-terminal extracellular 
domains believed to be involved in ligand binding (Montmayeur & Matsunami, Curr. Opin. 
NeurobioL, 12(4):366-371 (2002)). Within the cell, the taste receptors heterodimerize, with 
T1R3 coupling separately with T1R1 and T1R2. In mouse, the T1R1/T1R3 heterodimer 
functions as a receptor for selected amino acids. The T1R2/T1R3 heterodimer functions as a 
receptor for stimuli considered sweet by humans. Current data indicate that the T1R3 
component of the T1R heterodimer couples the taste receptor to cellular signal transduction 
processes, thereby ensuring that the stimulus-binding event is transduced to a neural signal. 
Thus, knowledge of the T1R receptors will lead to better understanding of species- specific 
reactions to sapid stimuli. 

[0007] Currently, mechanisms for identifying novel taste stimuli for the domestic cat are 
limited, for example, to exhaustive and difficult feeding studies in which a novel ingredient is 
paired with a control ingredient and intake of the two are compared. Considerable time, effort, 
and expense can be expended in the discovery of a single stimulus. Furthermore, feline illnesses 
often are exacerbated by a cat's refusal to eat. Additionally, the molecular features that define 
acceptable taste stimuli for domestic cat remain largely unknown, making rational computational 
design approaches for taste stimuli difficult. As a result, knowledge of the feline taste receptor 
and its ligands may lead to a better understanding of cat taste perception and modulation thereof. 

[0008] The present invention provides novel feline taste receptors, T1R1, T1R2, and T1R3, 
methods of use thereof to identify compounds that can stimulate, inhibit, or modify the ingestive 
responses or general behavior of a cat. The screening methods of the invention allow the rapid 
screening of binding partners, agonists, antagonists, and modulators of the T1R receptors of the 
domestic cat. The results of the feline T1R receptor studies reflect the unique taste profile of the 
domestic cat. 

SUMMARY OF THE INVENTION 

[0009] Certain embodiments of the present invention relate to polynucleotides encoding a T1R 
receptor, including, but not limited to polynucleotides having the nucleotide sequence of SEQ ID 
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NO:l, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:62 3 SEQ ID NO:63, fragments of the 
polynucleotide of SEQ ID NO:l, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:62, or SEQ ID 
NO:63 encoding a polypeptide having substantially the same biological activity as a polypeptide 
encoded by the nucleotide sequence of SEQ ID NO:l, SEQ ID NO:59, SEQ ID NO:60, SEQ ID 
NO:62, or SEQ ID NO:63, respectively; variants of the polynucleotide of SEQ ID NO:l having 
at least 80% homology to the polynucleotide of SEQ ID NO:l; variants of the polynucleotide of 
SEQ ID NO:59 or SEQ ID NO:60 having at least 85% homology to the polynucleotide of SEQ 
ID NO:59 or SEQ ID NO:60; variants of the polynucleotide of SEQ ID NO:62 or SEQ ID NO:63 
having at least 75% homology to SEQ ID NO:62 or SEQ ID NO:63; polynucleotide variants of 
SEQ ID NO:l, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:62, or SEQ ID NO:63 encoding a 
polypeptide having substantially the same biological activity as a polypeptide encoded by the 
nucleotide sequence of SEQ ID NO:l, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:62, or SEQ 
ID NO:63 5 respectively; variants of the polynucleotide of SEQ ID NO:l, SEQ ID NO:59, SEQ 
ID NO:60 5 SEQ ID NO:62, or SEQ ID NO:63 encoding a polypeptide conferring modified taste 
perception to one or more taste stimuli relative to a polypeptide encoded by the polynucleotide of 
SEQ ID NO:l, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:62, or SEQ ID NO:63, 
respectively; nucleotide sequences encoding the amino acid sequence of SEQ ID NO:2, SEQ ID 
NO:61 5 or SEQ ID NO:64; nucleotide sequences substantially complementary to the nucleotide 
sequence of SEQ ID NO:l, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:62, or SEQ ID NO:63; 
and nucleotide sequences that hybridize to the complement of the polynucleotide having SEQ ID 
NO:l, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:62, or SEQ ID NO:63 under high stringency 
conditions. The biological activity of the polypeptides encoded by the polynucleotides of the 
invention may be determined, for example, by an in vitro binding assay, such as but not limited 
to assessing the level of binding of the polypeptide to its respective T1R heterodimerization 
partner. The polynucleotides of the invention may be DNA or RNA and may be single- or 
double-stranded. In some embodiments of the invention, the polynucleotide fragments have at 
least about 42 nucleotides. The polynucleotide fragments of the invention encode, for example, 
an extracellular domain of the polypeptide of SEQ ID NO:2, SEQ ID NO:61, or SEQ ID NO:64; 
a transmembrane domain of the polypeptide of SEQ ID NO:2, SEQ ID NO:61, or SEQ ID 
NO:64; or an intracellular domain of the polypeptide of SEQ ID NO:2, SEQ ID NO:61, or SEQ 
ID NO:64. In other embodiments of the invention, the polynucleotide variants of the 
polynucleotide of SEQ ID NO:l encoding an amino acid sequence of SEQ ID NO:2 have a 
nonconserved amino acid substitution, for example, at residue 59 and/or residue 64. 
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[0010] The invention also encompasses expression vectors containing the polynucleotides of 
the invention operably linked to a promoter. Another embodiment of the invention provides host 
cells containing the expression vector. The host cells may be mammalian, including human, 
murine, porcine, bovine, canine, or feline. The invention further encompasses cell cultures of the 
host cells. The invention also encompasses methods of producing a feline T1R receptor by 
culturing the host cells and recovering receptor therefrom. 

[0011] Another embodiment of the invention includes T1R receptor polypeptides encoded by 
the polynucleotides of the invention. The polypeptides of the invention include, for example, 
those having the amino acid sequence of SEQ ID NO:2, SEQ ID NO:61, or SEQ ID NO:64, 
fragments of at least 30 contiguous amino acids of SEQ ID NO:2, SEQ ID NO:61, or SEQ ID 
NO:64 5 and variants thereof having substantially the same biological activity as the polypeptide 
of SEQ ID NO:2, SEQ ID NO:61, or SEQ ID NO:64, respectively. The variant polypeptides of 
the invention may have an amino acid sequence having at least one sequence variation of SEQ 
ID NO:2, SEQ ID NO:61, or SEQ ID NO:64 that confers modified taste perception to one or 
more taste stimuli relative to a polypeptide of SEQ ID NO:2, SEQ ID NO:61, or SEQ ID NO:64, 
respectively. 

[0012] The invention provides methods of identifying a feline T1R receptor variant that 
confers modified taste perception by expressing a variant of the polynucleotide of SEQ ED NO:l, 
SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:62, or SEQ ID NO:63 homologous to the 
polynucleotide of SEQ ID NO:l, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:62, or SEQ ID 
NO: 63, respectively, and detecting an increase or a decrease in the biological activity of the 
polypeptide encoded by the variant relative to the biological activity of the polypeptide encoded 
by SEQ ID NO:l, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:62, or SEQ ID NO:63, 
respectively. 

[0013] The invention further provides kits for the detection of polynucleotides encoding a 
feline T1R receptor including a polynucleotide that specifically hybridizes to a polynucleotide 
encoding a polypeptide having an amino acid sequence of SEQ ID NO:2, SEQ ID NO:61, or 
SEQ ID NO: 64, and instructions relating to detection thereof. 

[0014] Also provided by the invention are antibodies that immunoreact specifically with at 
least one epitope of a polypeptide of the invention. The invention also includes kits for the 
detection of polypeptides encoding a feline T1R receptor including antibodies of the invention 
and instructions relating to detection. 
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[0015] Further provided by the invention are methods for identifying a compound that interacts 
with a feline T1R receptor by expressing a polynucleotide of the invention in the presence of a 
test compound, and detecting direct or indirect interaction between a polypeptide produced by 
the expression step with the compound. Also provided are methods for identifying compounds 
that interact with a feline T1R receptor by contacting a feline T1R receptor with a test 
compound, and detecting interaction between the receptor and the compound. The methods for 
detecting such interaction may be cell-based or cell-free assays. For example, a polynucleotide 
of the invention may be expressed in a heterologous expression system or in a cellular extract. 
The receptor may be bound to a solid support. In one aspect of the invention, the recognition 
sites of the receptor are coupled with a monitoring system, either electrical or optical. In another 
embodiment, the solid support is formulated into a feline-specific electronic tongue or biosensor. 

[0016] The invention also provides methods for identifying agonists and antagonists of a feline 
T1R receptor. For example, the methods of the invention include identification of an agonist of a 
feline T1R receptor by expressing a polynucleotide of the invention in the presence of a test 
compound, and detecting increased transcription of said polynucleotide or increased biological 
activity of a polypeptide produced by the expression step in the presence of the compound 
relative to the rate of transcription or biological activity of the polypeptide in the absence of the 
compound. The biological activity detected may be an increase or decrease in the interaction 
between the T1R receptor and its T1R heterodomerization partner. For example, the T1R 
heterodimerization partner of a T1R1 or a T1R2 receptor may be T1R3 and vice versa. Also 
included are methods for identifying agonists of a feline T1R receptor by contacting a 
polypeptide of the invention with a test compound, and detecting an increase in biological 
activity of the polypeptide in the presence of the compound relative to biological activity of the 
polypeptide in the absence of the compound. The methods for identifying agonists of the cat T1R 
receptors may be cell-based or cell-free assays. For example, a polynucleotide of the invention 
may be expressed in a heterologous expression system or in a cellular extract. The receptor may 
be bound to a solid support. In one aspect of the invention, the recognition sites of the receptor 
are coupled with a monitoring system, either electrical or optical. In another embodiment, the 
solid support is formulated into a feline-specific electronic tongue or biosensor. 

[0017] Methods for identifying antagonists of the polypeptides of the invention also are 
provided. For example, the invention provides methods for identifying antagonists of a feline 
T1R receptor by expressing a polynucleotide of the invention in the presence of a test compound, 
and detecting decreased transcription of said polynucleotide or decreased biological activity of a 
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polypeptide produced by the expression step in the presence of the compound relative to the rate 
of transcription or biological activity of the polypeptide in the absence of the compound. 
Another example of methods for identifying an antagonist of a feline T1R receptor involves 
contacting a polypeptide of the invention with a test compound, and detecting a decrease in 
biological activity of the polypeptide in the presence of the compound relative to biological 
activity of the polypeptide in the absence of the compound. The methods for identifying the 
antagonists may be cell-based or cell-free assays. For example, a polynucleotide of the invention 
may be expressed in a heterologous expression system or in a cellular extract. The receptor may 
be bound to a solid support. In one aspect of the invention, the recognition sites of the receptor 
are coupled with a monitoring system, either electrical or optical. In another embodiment, the 
solid support is formulated into a feline-specific electronic tongue or biosensor. 

[0018] Another embodiment of the invention includes compounds and compositions for 
modifying the taste perception of a mammal, such as a cat. The compounds and compositions 
may contain at least one of the polynucleotides of the invention, polypeptides of the invention, or 
compounds identified by the methods of the invention. Examples of the compositions of the 
invention include veterinary foods and drinks and pharmaceutical compositions. The 
compositions of the invention may include a pharmaceutically acceptable excipient. The 
compositions of the invention may be breed-specific. Methods for modifying the taste 
perception of a mammal (e.g., a cat) by administering to the mammal a polynucleotide of the 
invention, a polypeptide of the invention, and/or a compound identified according to the methods 
of the invention also are provided. 

[0019] The invention further provides transgenic animals comprising a polynucleotide of the 
invention. 

[0020] The materials, methods, and examples provided herein are illustrative only and are not 
intended to be limiting. Other features and advantages of the invention will be apparent from the 
following detailed description and claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0021] Figures 1A-1I show the multiple sequence alignment of the T1R receptors of domestic 
cat (T1R1, SEQ ID NO:60; T1R2, SEQ ID NO:63; and T1R3, SEQ ID NO:l) with known 
nucleotide sequences of receptors of the T1R family from human (T1R1, SEQ ID NO:8; T1R2, 
SEQ ID NO:5; T1R3, SEQ ID NO:ll), mouse (T1R1, SEQ ID NO:6; T1R2, SEQ ID NO:3; 
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T1R3, SEQ ID NO:9), and rat (T1R1, SEQ ID N0:7; T1R2, SEQ ID N0:4; T1R3, SEQ ID 
NO: 10). An asterisk (*) indicates a conserved nucleotide position among the sequences. A heart 
(<?) indicates the stop codon of feline T1R2. 

[0022] Figures 2A-D show the deduced amino acid sequences of the feline T1R taste receptors 
(T1R1, SEQ ID NO:61; T1R2, SEQ ID NO:64; and T1R3, SEQ ID NO:2) aligned with the 
amino acid sequences of members of the T1R receptor family from human (T1R1, SEQ ID 
NO:17; T1R2, SEQ ID NO:20; T1R3, SEQ ID NO:12), rat (T1R1, SEQ ID NO:16; T1R2, SEQ 
ID NO:19; T1R3, SEQ ID NO:14), and mouse (T1R1, SEQ ID NO:15; T1R2, SEQ ID NO:18; 
T1R3, SEQ ID NO:13). An asterisk (*) indicates a conserved nucleotide position among the 
sequences. A colon (:) indicates an observed conserved amino acid substitution. A period (.) 
indicates an observed semi-conserved amino acid substitution. The deduced amino acid 
sequence for cat T1R3 (SEQ ID NO:2) contains four additional amino acids at positions 11-14 
relative to the homologous T1R3 receptors of mouse (SEQ ID NO:13), human (SEQ ID NO:12), 
and rat (SEQ ID NO: 14). The deduced sequence for cat reveals a threonine in position 64, a 
position equivalent to amino acid 60 in mouse, and a leucine at position 59, a position equivalent 
to position 55 in mouse. In mouse, amino acid substitutions of a threonine at position 60 and an 
alanine at position 55, both positions located within the putative extracellular N-terminal domain 
of the polypeptide, are present in strains of mice demonstrating low preference for the sweet 
stimulus saccharin (Bachmanov et aL, Chem. Senses, 26:925-933 (2001)). Leucine is a 
conservative substitution for alanine. Accordingly, the amino acid sequence differences of cat 
and mouse T1R3 receptor may account for functional differences that lead to different taste 
preferences between the two species. 

[0023] Figure 3 illustrates a phylogenetic tree showing the relatedness of the domestic cat T1R 
receptor family to the T1R family of receptors including human, rat, and mouse T1R1, T1R2, 
and T1R3. The T1R receptors of the rat and mouse are closely related, while the T1R receptors 
of human and cat diverge from rat and mouse. Interestingly, the sweet stimuli to which the rat 
and mouse respond are very similar, whereas those that stimulate the human and those that 
stimulate the cat differ from one another and from those for rat and mouse. For example, humans 
are unique in their ability to taste most high-intensity sweeteners, while cats find many amino 
acids attractive but are unable to taste most carbohydrate and high-intensity sweeteners. The cat 
T1R2 diverges from that of human, mouse, and rat, which is consistent with the fact that cat does 
not show preference for the carbohydrate sweeteners. 
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[0024] Figure 4 illustrates the predicted conformation of cat T1R3 receptor. The cat T1R3 
receptor is a seven transmembrane receptor similar in structure to other known members of the 
T1R family of receptors. The structure of the feline T1R3 receptor was generated through use of 
a protein modeling program available at <www.ebi.ac.uk/-moeller/transmembrane.html>. 

[0025] Figure 5 A shows the predicted conformation of cat T1R1, indicating that the receptor is 
a 7-transmembrane-type receptor with general similarity to other known members of the T1R 
family. The predicted conformation of cat T1R1 is the same as that for cat T1R3. Figure SB 
illustrates the predicted conformation of cat T1R2. Since feline T1R2 is a short protein (391 
amino acids), a 7 transmembrane domain protein is not predicted. Without seven transmembrane 
domains, the cat T1R2 receptor may not interact appropriately with T1R3 and the plasma 
membrane. This inability to form the T1R2/T1R3 heterodimer results in the cat's inability to 
taste sweet carbohydrates. The cat T1R2 may have another function. 

[0026] Figures 6A-D show the genomic sequence of cat T1R1 obtained from BAC 
sequencing. The letter "N" denotes gaps between exons or unknown sequences. 

[0027] Figures 7A-E show the genomic sequence of cat T1R2 obtained from BAC 
sequencing. The letter "N" denotes gaps between exons or unknown sequences. 

DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 

[0028] The reference works, patents, patent applications, and scientific literature that are 
referred to herein reflect in part the knowledge of those with skill in the art and are hereby 
incorporated by reference in their entirety to the same extent as if each was specifically and 
individually indicated to be incorporated by reference. Any conflict between any reference cited 
herein and the specific teachings of this specification shall be resolved in favor of the latter. 
Likewise, any conflict between an art-understood definition of a word or phrase and a definition 
of the word or phrase as specifically taught in this specification shall be resolved in favor of the 
latter. 

[0029] Standard reference works setting forth the general principles of recombinant DNA 
technology are known to those of skill in the art (Ausubel et aL 9 CURRENT PROTOCOLS IN 
Molecular Biology, John Wiley & Sons, New York, 1998; Sambrook et aL, Molecular 
Cloning: A Laboratory Manual, 2d Ed., Cold Spring Harbor Laboratory Press, Plainview, 
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New York, 1989; Kaufman et al., Eds., Handbook of Molecular and Cellular Methods in 
Biology and Medicine, CRC Press, Boca Raton, 1995; McPherson, Ed., Directed 
Mutagenesis: A Practical Approach, IRL Press, Oxford, 1991). 

[0030] As used herein, "T1R receptor'' encompasses the taste receptors of the T1R1, T1R2, 
and T1R3 types. 

[0031] As used herein, "taste perception" refers to a response (e.g., biochemical, behavioral) or 
sensitivity of a T1R receptor of the invention to a taste stimulus. "Taste stimulus" as used herein 
refers to any compound that elicits, for example at the biochemical level (e.g., activation or 
inhibition of a taste receptor) or behavioral level (e.g., preference, indifference, or distaste) , a 
taste response which would be perceived by a mammal as at least one of the five taste elements, 
including sweet, salty, sour, bitter, and umami. "Taste perception" or "taste stimulus," or 
variants thereof, does not require, though it does include, transmission of a neural signal 
resulting in in vivo sensation of taste by a mammal. Modification of taste perception includes an 
alteration of (enhancement of, reduction to, or change to) a biochemical response, an ingestive 
response, a taste preference, or general behavior of a mammal in response to a compound. 

[0032] As used herein "polynucleotide" refers to a nucleic acid molecule and includes genomic 

DNA, cDNA, RNA, mRNA, mixed polymers, recombinant nucleic acids, fragments and variants 

thereof, and the like. Polynucleotide fragments of the invention comprise at least 10, and 

preferably at least 12, 14, 16, 18, 20, 25, 30, 35, 40, 45, 50, 75, or 100 consecutive nucleotides of 

a reference polynucleotide. The polynucleotides of the invention include sense and antisense 

strands. The polynucleotides of the invention may be naturally occurring or non-naturally 

occurring polynucleotides. A "synthesized polynucleotide" as used herein refers to 

polynucleotides produced by purely chemical, as opposed to enzymatic, methods. "Wholly" 

synthesized DNA sequences are therefore produced entirely by chemical means, and "partially" 

synthesized DNAs embrace those wherein only portions of the resulting DNA were produced by 

chemical means. The polynucleotides of the invention may be single- or double-stranded. The 

polynucleotides of the invention may be chemically modified and may contain non-natural or 

derivatized nucleotide bases as will be readily appreciated by those skilled in the art. Such 

modifications include, for example, labels, methylation, substitution of one or more nucleotides 

with an analog, internucleotide modifications such as uncharged linkages (e.g., methyl 

phosphonates, phosphotriesters, phosphoramidates, carbamates, etc.), charged linkages (e.g., 

phosphorothioates, phosphorodithioates, etc.), pendent moieties (e.g., polypeptides, etc.), 

intercalators (e.g., acridine, psoralen, etc.), chelators, alkylators, and modified linkages (e.g., 
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alpha anomeric nucleic acids, etc.). Also included are synthetic molecules that mimic 
polynucleotides in their ability to bind to a designated sequence via hydrogen bonding and other 
chemical interactions. Such molecules are known in the art and include, for example, those in 
which peptide linkages substitute for phosphate linkages in the backbone of the molecule. 

[0033] "Recombinant nucleic acid" is a nucleic acid generated by combination of two 
segments of nucleotide sequence. The combination may be, for example, by chemical means or 
by genetic engineering. 

[0034] As used herein, "polynucleotide amplification" refers to a broad range of techniques for 
increasing the number of copies of specific polynucleotide sequences. Typically, amplification 
of either or both strand(s) of the target nucleic acid comprises the use of one or more nucleic 
acid-modifying enzymes, such as a DNA polymerase, ligase, RNA polymerase, or RNA- 
dependent reverse transcriptase. Examples of polynucleotide amplification include, but are not 
limited to, polymerase chain reaction (PCR), nucleic acid sequence based amplification (NASB), 
self-sustained sequence replication (3SR), strand displacement activation (SDA), ligase chain 
reaction, QP replicase system, and the like. A wide variety of alternative cloning and in vitro 
amplification methodologies are well known to those skilled in the art. Examples of these 
techniques are found in, for example, Berger et al., Guide to Molecular Cloning Techniques, 
Methods in Enzymology 152, Academic Press, Inc., San Diego, CA (Berger), which is 
incorporated herein by reference in its entirety. 

[0035] As used herein, the term "oligonucleotide" or "primer" refers to a series of linked 
nucleotide residues which has a sufficient number of bases to be used in a polymerase chain 
reaction (PCR). This short sequence is based on (or designed from) a genomic or cDNA 
sequence and is used to amplify, confirm, or reveal the presence of an identical, similar, or 
complementary DNA or RNA in a particular cell or tissue. Oligonucleotides comprise portions 
of a nucleic acid sequence having at least about 10 nucleotides and as many as about 50 
nucleotides, often about 12 or 15 to about 30 nucleotides. They are chemically synthesized and 
may be used as probes. "Primer pair" refers to a set of primers including a 5' upstream primer 
that hybridizes with the 5' end of a target sequence to be amplified and a 3' downstream primer 
that hybridizes with the complement of the 3' end of the target sequence to be amplified. 

[0036] As used herein, the term "probe" refers to nucleic acid sequences of variable length, for 
example between at least about 10 and as many as about 6,000 nucleotides, depending on use. 
Probes are used in the detection of identical, similar, or complementary target nucleic acid 
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sequences, which target sequences may be single- or double-stranded. Longer probes are usually 
obtained from a natural or recombinant source, are highly specific, and are much slower to 
hybridize than oligomers, or shorter probes. They may be single- or double-stranded and are 
carefully designed to have specificity in PCR, hybridization membrane-based, or ELISA-like 
technologies. An "overgo probe" is a DNA probe comprising two short, overlapping DNA 
sequences (e.g., 10-50 nucleotides each) with a complementary overlapping region (e.g., 5-15 
nucleotides) that is used in an overgo hybridization strategy. For example, an overgo probe may 
be two 22mers with an 8 bp complementary overlap, resulting in a 36mer overgo probe. As 
another example, an overgo probe may be two 24mers with an 8 bp complementary overlap, 
resulting in a 40mer overgo probe. 

[0037] As used herein, the phrase "stringent hybridization conditions" or "stringent conditions" 
refers to conditions under which a probe, primer, or oligonucleotide will hybridize to its target 
sequence, but to a minimal number of other sequences. Stringent conditions are sequence- 
dependent and will be different in different circumstances. Longer sequences will hybridize with 
specificity to their proper complements at higher temperatures. Generally, stringent conditions 
are selected to be about 5°C lower than the thermal melting point (T m ) for the specific sequence 
at a defined ionic strength and pH. The T m is the temperature (under defined ionic strength, pH 
and nucleic acid concentration) at which 50% of the probes complementary to the target 
sequence hybridize to the target sequence at equilibrium. Since the target sequences are 
generally present in excess, at T m , 50% of the probes are hybridized to their complements at 
equilibrium. Stringent temperature conditions will generally include temperatures in excess of 
30°C, typically in excess of 37°C, and may be in excess of 45°C. Stringent salt conditions will 
ordinarily be less than 1.0 M, typically less than 0.5 M, and may be less than 0.2 M. Typically, 
stringent conditions will be those in which the salt concentration is less than about 1.0M sodium 
ion, typically about 0.01 to 1.0 M sodium ion (or other salts) at pH 7.0 to 8.3 and the temperature 
is at least about 30°C for short probes, primers, or oligonucleotides (e.g., 10 to 50 nucleotides) 
and at least about 60°C for longer probes, primers, or oligonucleotides. Stringent conditions may 
also be achieved with the addition of destabilizing agents, such as formamide. 

[0038] As used herein "antisense oligonucleotide" refers to a nucleic acid molecule that is 
complementary to at least a portion of a target nucleotide sequence of interest and specifically 
hybridizes to the target nucleotide sequence under physiological conditions. The term "double 
stranded RNA" or "dsRNA" as used herein refers to a double-stranded RNA molecule capable of 
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RNA interference, including short interfering RNA (siRNA) (see for example, Bass, Nature, 
41 1, 428-429 (2001); Elbashir et al 9 Nature, 411, 494-498 (2001)). 

[0039] As used herein, the term "complementary" refers to Watson-Crick basepairing between 
nucleotide units of a nucleic acid molecule. 

[0040] The term "marker gene" or "reporter gene" refers to a gene encoding a product that, 
when expressed, confers a phenotype at the physical, morphologic, or biochemical level on a 
transformed cell that is easily identifiable, either directly or indirectly, by standard techniques 
and includes, but is not limited to, genes encoding proteins that confer resistance to toxins or 
antibiotics such as ampicillin, neomycin, and methotroxate; genes encoding proteins that 
complement auxotrophic deficiencies; and genes encoding proteins that supply critical 
components not available from complex media. Examples of marker genes include green 
fluorescent protein (GFP), red fluorescent protein (DsRed), alkaline phosphatase (AP), P- 
lactamase, chloramphenicol acetyltransferase (CAT), adenosine deaminase (ADA), 
aminoglycoside phosphotransferase (neor, G418r) dihydrofolate reductase (DHFR), hygromycin- 
B-phosphotransferase (HPH), thymidine kinase (TK), lacZ (encoding p-galactosidase), luciferase 
(luc), and xanthine guanine phosphoribosyltransferase (XGPRT). As with many of the standard 
procedures associated with the practice of the invention, skilled artisans will be aware of 
additional sequences that can serve the function of a marker or reporter. Thus, this list is merely 
meant to show examples of what can be used and is not meant to limit the invention. 

[0041] As used herein, the term "promoter" refers to a regulatory element that regulates, 
controls, or drives expression of a nucleic acid molecule of interest and can be derived from 
sources such as from adenovirus, SV40, parvoviruses, vaccinia virus, cytomegalovirus, or 
mammalian genomic DNA. Examples of suitable promoters include, but are not limited to, 
CMV, MSH2, tip, lac, phage, and TRNA promoters. Suitable promoters that can be used in yeast 
include, but are not limited to, such constitutive promoters as 3-phosphoglycerate kinase and 
various other glycolytic enzyme gene promoters such as enolase or glyceraldehydes-3-phosphate 
dehydrogenase, or such inducible promoters as the alcohol dehydrogenase 2 promoter or 
metallothionine promoter. Again, as with many of the standard procedures associated with the 
practice of the invention, skilled artisans will be aware of additional promoters that can serve the 
function of directing the expression of a marker or reporter. Thus, the list is merely meant to 
show examples of what can be used and is not meant to limit the invention. 



-13- 



MON-0298 



[0042] "Operably linked" refers to juxtaposition wherein the components are in a functional 
relationship. For example, a promoter is operably linked or connected to a coding sequence if it 
controls the transcription or expression of the sequence. 

[0043] The terms "polypeptide," "peptide," and "protein" are used interchangeably herein. 
"Polypeptide" refers to a polymer of amino acids without referring to a specific length. 
Polypeptides of the invention include peptide fragments, derivatives, and fusion proteins. 
Peptide fragments preferably have at least about 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 
or 100 amino acids. Some peptide fragments of the invention are biologically active. Biological 
activities include immunogenicity, ligand binding, and activity associated with the reference 
peptide. Immunogenic peptides and fragments of the invention generate an epitope-specific 
immune response, wherein "epitope" refers to an immunogenic determinant of a peptide and 
preferably contains at least three, five, eight, nine, ten, fifteen, twenty, thirty, forty, forty-five, or 
fifty amino acids. Some immunogenic peptides of the invention generate an immune response 
specific to that peptide. Polypeptides of the invention include naturally occurring and non- 
naturally occurring peptides. The term includes modified polypeptides (wherein examples of 
such modifications include glycosylation, acetylation, phosphorylation, carboxylation, 
ubiquitination, labeling, eta), analogs (such as non-naturally occurring amino acids, substituted 
linkages, etc.), and functional mimetics. A variety of methods for labeling polypeptides are well 
known in the art and include radioactive isotopes such as 32 P or 35 S, ligands that bind to labeled 
antiligands (e.g., antibodies), fluorophores, chemiluminescent agents, enzymes, and antligands. 

[0044] As used herein, the term "amino acid" denotes a molecule containing both an amino 
group and a carboxyl group. In some embodiments, the amino acids are a-, P-, y- or 8-amino 
acids, including their stereoisomers and racemates. As used herein the term "L-amino acid" 
denotes an a-amino acid having the L configuration around the a-carbon, that is, a carboxylic 
acid of general formula CH(COOH)(NH2)-(side chain), having the L-configuration. The term 
"D-amino acid" similarly denotes a carboxylic acid of general formula CH(COOH)(NH2)-(side 
chain), having the D-configuration around the a-carbon. Side chains of L-amino acids include 
naturally occurring and non-naturally occurring moieties. Non-naturally occurring (i.e., 
unnatural) amino acid side chains are moieties that are used in place of naturally occurring amino 
acid side chains in, for example, amino acid analogs. Amino acid substituents may be attached, 
for example, through their carbonyl groups through the oxygen or carbonyl carbon thereof, or 
through their amino groups, or through functionalities residing on their side chain portions. 
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[0045] The amino acid sequences are presented in the amino (N) to carboxy (C) direction, from 
left to right. The N-terminal oc-amino group and the C-terminal P-carboxy groups are not 
depicted in the sequence. The nucleotide sequences are presented by single strands only, in the 
5' to 3' direction, from left to right. Nucleotides and amino acids are represented in the manner 
recommended by the IUPAC-IUB Biochemical Nomenclature Commission, or amino acids are 
represented by their three letters code designations. 

[0046] As used herein, the term "antibody" is meant to refer to complete, intact antibodies, and 
Fab, Fab', F(ab) 2 , F v , and other fragments thereof. Complete, intact antibodies include 
antibodies such as polyclonal antibodies, monoclonal antibodies, chimeric antibodies, and 
humanized antibodies, felinized antibodies, and immunologic binding equivalents thereof. The 
antibodies of the invention may be labeled or unlabeled. Examples of labels of antibodies 
include, but are not limited to, radionuclides, enzymes, substrates, cofactors, inhibitors, 
fluorescent agents, chemiluminescent agents, magnetic particles, and the like. Recombinant 
immunoglobulins are included in the invention. 

[0047] As used herein, the term "binding" means the physical or chemical interaction between 
two proteins or compounds or associated proteins or compounds or combinations thereof. 
Binding includes ionic, non-ionic, Hydrogen bonds, Van der Waals, hydrophobic interactions, 
etc. The physical interaction, the binding, can be either direct or indirect, indirect being through 
or due to the effects of another protein or compound. Direct binding refers to interactions that do 
not take place through or due to the effect of another protein or compound but instead are 
without other substantial chemical intermediates. Binding may be detected in many different 
manners. As a non-limiting example, the physical binding interaction between two molecules 
can be detected using a labeled compound. Other methods of detecting binding are well-known 
to those of skill in the art. 

[0048] As used herein, the term "contacting" means bringing together, either directly or 
indirectly, a compound into physical proximity to a molecule of interest. Contacting may occur, 
for example, in any number of buffers, salts, solutions, or in a cell or cell extract. 

[0049] As used herein, the terms "modulates" or "modifies" means an increase or decrease in 
the amount, quality, or effect of a particular activity or protein. "Modulators" refer to any 
inhibitory or activating molecules identified using in vitro and in vivo assays for, e.g., agonists, 
antagonists, and their homologs, including fragments, variants, and mimetics, as defined herein, 
that exert substantially the same biological activity as the molecule. "Inhibitors" or "antagonists" 
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are modulating compounds that reduce, decrease, block, prevent, delay activation, inactivate, 
desensitize, or downregulate the biological activity or expression of a molecule or pathway of 
interest. "Inducers/' "activators," or "agonists" are modulating compounds that increase, induce, 
stimulate, open, activate, facilitate, enhance activation, sensitize, or upregulate a molecule or 
pathway of interest. In some preferred embodiments of the invention, the level of inhibition or 
upregulation of the expression or biological activity of a molecule or pathway of interest refers to 
a decrease (inhibition or downregulation) or increase (upregulation) of greater than about 50%, 
60%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%. The 
inhibition or upregulation may be direct, i.e., operate on the molecule or pathway of interest 
itself, or indirect, i.e., operate on a molecule or pathway that affects the molecule or pathway of 
interest. 

[0050] A "purified" or "substantially purified" polynucleotide or polypeptide is substantially 
separated from other cellular components that naturally accompany a native (or wild-type) 
nucleic acid or polypeptide and/or from other impurities (e.g., agarose gel). A purified 
polypeptide or protein will comprise about 60% to more than 99% w/w of a sample, and may be 
about 90%, about 95%, or about 98% pure. As used herein, the term "isolated" refers to a 
molecule that has been removed from its native environment. Examples of isolated nucleic acid 
molecules include, but are not limited to, recombinant DNA molecules contained in a vector, 
recombinant DNA molecules maintained in a heterologous host cell, partially or substantially 
purified nucleic acid molecules, and synthetic DNA or RNA molecules. 

[0051] "About" as used herein refers to +/- 10% of the reference value. 

[0052] As used herein, "variant" nucleotide or amino acid sequences refer to homologs, 

including, for example, isoforms, species variants, allelic variants, and fragments of the sequence 

of interest. "Homologous nucleotide sequence" or "homologous amino acid sequence," or 

variations thereof, refers to sequences characterized by a homology, at the nucleotide level or 

amino acid level, of at least about 60%, at least about 70%, at least about 75%, at least about 

80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least 

about 85%, preferably at least about 90%, at least about 95%, at least about 98%, or at least 

about 99%, and more preferably 100%, to a reference sequence, or portion or fragment thereof 

encoding or having a functional domain. The reference sequence may include, for example, but 

is not limited to the nucleic acid sequence of SEQ ID NO:l, SEQ ID NO:59, SEQ ID NO:60, 

SEQ ID NO:62, and SEQ ID NO:63, or portions thereof which encode a functional domain of 

the encoded polypeptide, SEQ ID NO:2, SEQ ID NO:61, or SEQ ID NO:64, or the polypeptide 
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having amino acid sequence SEQ ID NO:2, SEQ ID NO:61, or SEQ ID NO:64 5 or fragments 
thereof having functional domains of the full-length polypeptide. Functional domains of the T1R 
receptors of the invention include extracellular domains, transmembrane domains, and 
intracellular domains. Examples of functional domains of the T1R2 polypeptide of SEQ ID 
NO:61 include extracellular domains corresponding to residues 1-563, 624-635, 701-726, and 
781-792; transmembrane domains corresponding to residues 564-589, 604-623, 636-660, 681- 
700, 727-748, 761-780, and 793-817; and intracellular domains corresponding to residues 590- 
603, 661-680, 749-760, and 818-841. Examples of functional domains of the T1R2 receptor of 
SEQ ID NO:64 include an extracellular domain corresponding to residues 1-147; a 
transmembrane domain corresponding to residues 148-167; and an intracellular domain 
corresponding to residues 168-391. Examples of functional domains of the T1R3 polypeptide of 
SEQ ID NO:2 include the extracellular domains (residues 1-571, 628-641, 705-730, and 787-794 
of SEQ ID NO:2), the transmembrane domains (residues 572-594, 610-627, 642-664, 681-704, 
731-754, 767-780, and 795-812 of SEQ ID NO:2), and the intracellular domains (residues 595- 
609, 665-680, 755-766, and 813-865 of SEQ ID NO:2). Isoforms can be expressed in different 
tissues of the same organism as a result of, for example, alternative splicing of RNA. 
Alternatively, isoforms can be encoded by different genes. Homologous nucleotide sequences 
include nucleotide sequences encoding for a species variant of a protein. Homologous nucleotide 
sequences also include, but are not limited to, naturally occurring allelic variations and mutations 
of the nucleotide sequences set forth herein. Study of mutations and polymorphisms of the T1R 
receptor polynucleotide sequences may explain breed-specific and/or individual taste preferences 
of a mammal such as a cat. Additionally, sequence variants of the T1R receptors may be 
associated with specific disease states, such that knowledge of the genes allows diagnosis and 
treatment of TIR-associated disorders (e.g., obesity, diabetes). Homologous amino acid 
sequences include those amino acid sequences which encode conservative amino acid 
substitutions in polypeptides having an amino acid sequence of SEQ ID NO:2, SEQ ID NO:61, 
or SEQ ID NO: 64, as well as in polypeptides identified according to the methods of the 
invention. Percent homology may be determined by, for example, the Gap program (Wisconsin 
Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, University Research 
Park, Madison Wis.), using the default settings, which uses the algorithm of Smith and 
Waterman (Smith and Waterman, Adv. Appl. Math., 2: 482-489, 1981). Nucleic acid fragments 
of the invention preferably have at least about 5, at least about 10, at least about 15, at least about 
20, at least about 25, at least about 50, or at least about 100 nucleotides of the reference 
nucleotide sequence. The nucleic acid fragments of the invention may encode a polypeptide 
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having at least one biological property, or function, that is substantially similar to a biological 
property of the polypeptide encoded by the full-length nucleic acid sequence. 

[0053] As is well known in the art, because of the degeneracy of the genetic code, there are 
numerous DNA and RNA molecules that can code for the same polypeptide as that encoded by a 
nucleotide sequence of interest. The present invention, therefore, contemplates those other DNA 
and RNA molecules which, on expression, encode a polypeptide encoded by the nucleic acid 
molecule of interest. DNA and RNA molecules other than those specifically disclosed herein 
characterized simply by a change in a codon for a particular amino acid, are within the scope of 
this invention. 

[0054] Amino acid "insertions", "substitutions" or "deletions" are changes to or within an 
amino acid sequence. The variation allowed in a particular amino acid sequence may be 
experimentally determined by producing the peptide synthetically or by systematically making 
insertions, deletions, or substitutions of nucleotides in the nucleic acid sequence using 
recombinant DNA techniques. Alterations of the naturally occurring amino acid sequence can be 
accomplished by any of a number of known techniques. For example, mutations can be 
introduced into the polynucleotide encoding a polypeptide at particular locations by procedures 
well known to the skilled artisan, such as oligonucleotide-directed mutagenesis, which is 
described by U.S. Pat Nos. 4,518,584 and 4,737,462. 

[0055] A polypeptide variant of the present invention may exhibit substantially the biological 

activity of a naturally occurring reference polypeptide. "Biological activity" as used herein refers 

to the level of a particular function (for example, enzymatic activity) of a molecule or pathway of 

interest in a biological system. "Wild-type biological activity" refers to the normal level of 

function of a molecule or pathway of interest. "Reduced biological activity" refers to a 

decreased level of function of a molecule or pathway of interest relative to a reference level of 

biological activity of that molecule or pathway. For example, reduced biological activity may 

refer to a decreased level of biological activity relative to the wild-type biological activity of a 

molecule or pathway of interest. "Increased biological activity" refers to an increased level of 

function of a molecule or pathway of interest relative to a reference level of biological activity of 

that molecule or pathway. For example, increased biological activity may refer to an increased 

level of biological activity relative to the wild-type biological activity of a molecule or pathway 

of interest. Reference to exhibiting "substantially the biological activity of a naturally occurring 

polypeptide" indicates that variants within the scope of the invention can comprise 

conservatively substituted sequences, meaning that one or more amino acid residues of a 
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polypeptide are replaced by different residues that do not alter the secondary and/or tertiary 
structure of the polypeptide. Such substitutions may include the replacement of an amino acid by 
a residue having similar physicochemical properties, such as substituting one aliphatic residue 
(He, Val, Leu or Ala) for another, or substitution between basic residues Lys and Arg, acidic 
residues Glu and Asp, amide residues Gin and Asn, hydroxyl residues Ser and Tyr, or aromatic 
residues Phe and Tyr. Further information regarding making phenotypically silent amino acid 
exchanges are known in the art (Bowie et ai, Science, 247: 1306-1310, 1990). Other polypeptide 
homologs which might retain substantially the biological activities of the reference polypeptide 
are those where amino acid substitutions have been made in areas outside functional regions of 
the protein. The biological activity may be assessed by, for example, measuring binding of a 
T1R receptor of the invention to its heterodimerization partner. 

[0056] A nucleotide and/or amino acid sequence of a nucleic acid molecule or polypeptide 
employed in the invention or of a compound identified by the screening method of the invention 
may be used to search a nucleotide and amino acid sequence databank for regions of similarity 
using Gapped BLAST (Altschul et al. 9 Nuc. Acids Res,, 25: 3389, 1997). Briefly, the BLAST 
algorithm, which stands for Basic Local Alignment Search Tool is suitable for determining 
sequence similarity (Altschul et aL, J Mol. Biol., 215: 403-410, 1990). Software or performing 
BLAST analyses is publicly available through the National Center for Biotechnology 
Information (http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high 
scoring sequence pair (HSPs) by identifying short words of length W in the query sequence that 
either match or satisfy some positive-valued threshold score T when aligned with a word of the 
same length in a database sequence. T is referred to as the neighborhood word score threshold 
(Altschul et al, J Mol Biol, 215: 403-410, 1990). These initial neighborhood word hits act as 
seeds for initiating searches to find HSPs containing them. The word hits are extended in both 
directions along each sequence for as far as the cumulative alignment score can be increased. 
Extension for the word hits in each direction are halted when: 1) the cumulative alignment score 
falls off by the quantity X from its maximum achieved value; 2) the cumulative score goes to 
zero or below, due to the accumulation of one or more negative-scoring residue alignments; or 3) 
the end of either sequence is reached. The BLAST algorithm parameters W, T, and X determine 
the sensitivity and speed of the alignment. The BLAST program uses as defaults a word length 
(W) of 11, the BLOSUM62 scoring matrix (Henikoff et al, Proc. Natl Acad. ScL USA, 89: 
10915-10919, 1992) alignments (B) of 50, expectation (E) of 10, M=5, N=4, and a comparison 
of both strands. The BLAST algorithm (Karlin et al, Proc, Natl Acad. Sci. USA, 90: 5873-5787, 
1993) and Gapped BLAST perform a statistical analysis of the similarity between two sequences. 
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One measure of similarity provided by the BLAST algorithm is the smallest sum probability 
(P(N)), which provides an indication of the probability by which a match between two nucleotide 
or amino acid sequences would occur by chance. For example, a nucleic acid is considered 
similar to a gene or cDNA if the smallest sum probability in comparison of the test nucleic acid 
to the reference nucleic acid is less than about 1, preferably less than about 0.1, more preferably 
less than about 0.01, and most preferably less than about 0.001. 

[0057] The term "mimetic" as used herein refers to a compound that is sterically similar to a 
reference compound. Mimetics are structural and functional equivalents to the reference 
compounds. 

[0058] The terms "patient" and "subject" are used interchangeably herein and include, but are 
not limited to, avians, felines, canines, bovines, ovines, porcines, equines, rodents, simians, and 
humans. "Host cell" includes, for example, a mammalian cell (e.g., human, rodent, feline), yeast 
cell, or plant cell. "Rodents" include, for example, rats and mice. 

[0059] The term "treatment" as used herein refers to any indicia of success of prevention, 
treatment, or amelioration of a disease or condition. Treatment includes any objective or 
subjective parameter, such as, but not limited to, abatement, remission, normalization of receptor 
activity, reduction in the number or severity of symptoms or side effects, or slowing of the rate 
of degeneration or decline of the patient. Treatment also includes a prevention of the onset of 
symptoms in a patient that may be at increased risk for or is suspected of having a disease or 
condition but does not yet experience or exhibit symptoms thereof. 

[0060] As used herein, the term "compound" means any identifiable chemical or molecule, 
including, but not limited to a small molecule, peptide, protein, sugar, nucleotide, or nucleic acid. 
Such compound can be natural or synthetic. 

PoIymiMcleotides 

[0061] The invention provides purified and isolated polynucleotides (e.g., cDNA, genomic 
DNA, synthetic DNA, RNA, or combinations thereof, whether single- or double-stranded) that 
comprise a nucleotide sequence encoding the amino acid sequence of the polypeptides of the 
invention. Such polynucleotides are useful for recombinantly expressing the receptor and also 
for detecting expression of the receptor in cells (e.g., using Northern hybridization and in situ 
hybridization assays). Such polynucleotides also are useful in the design of antisense and other 
molecules for the suppression of the expression of a T1R receptor in a cultured cell, a tissue, or 
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an animal; for therapeutic purposes; or to provide a model for diseases or conditions 
characterized by aberrant T1R expression. Specifically excluded from the definition of 
polynucleotides of the invention are entire isolated, non-recombinant native chromosomes of 
host cells. Polynucleotides of the invention include the nucleotide sequence of SEQ ID NO:l, 
SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:62, and SEQ ID NO:63. It will be appreciated that 
numerous other polynucleotide sequences exist that also encode the T1R receptors of the 
invention due to the well-known degeneracy of the universal genetic code. 

[0062] The invention also provides a purified and isolated polynucleotide comprising a 
nucleotide sequence that encodes a mammalian (e.g., feline) polypeptide, wherein the 
polynucleotide hybridizes to a polynucleotide having a sequence of SEQ ID NO:l, or the non- 
coding strand complementary thereto, under stringent hybridization conditions. 

[0063] Genomic DNA of the invention comprises the protein-coding region for a polypeptide 
of the invention and is also intended to include allelic variants thereof. It is widely understood 
that, for many genes, genomic DNA is transcribed into RNA transcripts that undergo one or 
more splicing events wherein intron (i.e., non-coding regions) of the transcripts are removed, or 
"spliced out " RNA transcripts that can be spliced by alternative mechanisms, and therefore be 
subject to removal of different RNA sequences but still encode a T1R3 polypeptide, are referred 
to in the art as splice variants which are embraced by the invention. Splice variants 
comprehended by the invention therefore are encoded by the same original genomic DNA 
sequences but arise from distinct mRNA transcripts. Allelic variants are modified forms of a 
wild-type gene sequence, the modification resulting from recombination during chromosomal 
segregation or exposure to conditions which give rise to genetic mutation. Allelic variants, like 
wild type genes, are naturally occurring sequences (as opposed to non-naturally occurring 
variants that arise from in vitro manipulation). 

[0064] The invention also comprehends cDNA that is obtained through reverse transcription of 
an RNA polynucleotide encoding a T1R receptor (conventionally followed by second strand 
synthesis of a complementary strand to provide a double-stranded DNA). 

[0065] One embodiment of the DNA of the invention comprises a double-stranded molecule 
along with the complementary molecule (the "non-coding strand" or "complement") having a 
sequence unambiguously deducible from the coding strand according to Watson-Crick base- 
pairing rules for DNA. 
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[0066] The present invention includes fragments of nucleotide sequences encoding a T1R 
receptor comprising at least 10, and preferably at least 12, 14, 16, 18, 20, 25, 30, 35, 40, 45, 50, 
75, or 100 consecutive nucleotides of a polynucleotide encoding a T1R receptor. Fragment 
polynucleotides of the invention may comprise sequences unique to the TIR-encoding 
polynucleotide sequence, and therefore hybridize under highly stringent or moderately stringent 
conditions only (i.e., "specifically") to polynucleotides encoding a T1R receptor (or fragments 
thereof). Polynucleotide fragments of genomic sequences of the invention comprise not only 
sequences unique to the coding region, but also include fragments of the full-length sequence 
derived from introns, regulatory regions, and/or other non-translated sequences. Sequences 
unique to polynucleotides of the invention are recognizable through sequence comparison to 
other known polynucleotides, and can be identified through use of alignment programs routinely 
utilized in the art, e.g., those made available in public sequence databases. Such sequences also 
are recognizable from Southern hybridization analyses to determine the number of fragments of 
genomic DNA to which a polynucleotide will hybridize. Polynucleotides of the invention can be 
labeled in a manner that permits their detection, including radioactive, fluorescent, and 
enzymatic labeling. 

[0067] Fragment polynucleotides are particularly useful as probes for detection of full-length 
or fragments of T1R polynucleotides. One or more polynucleotides can be included in kits that 
are used to detect the presence of a polynucleotide encoding a T1R receptor, or used to detect 
variations in a polynucleotide sequence encoding a T1R receptor. 

[0068] The invention also embraces DNAs encoding T1R polypeptides that hybridize under 
high stringency conditions to the non-coding strand, or complement, of the polynucleotides. 

[0069] Exemplary highly stringent hybridization conditions are as follows: hybridization at 
42°C in a hybridization solution comprising 50% formamide, 1% SDS, 1 M NaCl, 10% Dextran 
sulfate, and washing twice for 30 minutes at 60°C in a wash solution comprising 0.1 X SSC and 
1% SDS. It is understood in the art that conditions of equivalent stringency can be achieved 
through variation of temperature and buffer, or salt concentration as described, for example, in 
Ausubel et ah (Eds.), Protocols in Molecular Biology, John Wiley & Sons (1994), pp. 6.0.3 
to 6.4.10. Modifications in hybridization conditions can be empirically determined or precisely 
calculated based on the length and the percentage of guanosine/cytosine (GC) base pairing of the 
probe. The hybridization conditions can be calculated as described, for example, in Sambrook et 
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al, (Eds.), Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory 
Press: Cold Spring Harbor, New York (1989), pp. 9.47 to 9.51. 

[0070] With the knowledge of the nucleotide sequence information disclosed in the present 
invention, one skilled in the art can identify and obtain nucleotide sequences which encode T1R 
receptors from different sources (i.e., different tissues or different organisms) through a variety 
of means well known to the skilled artisan and as disclosed by, for example, Sambrook et al. 9 
Molecular cloning: a laboratory manual, Second Edition, Cold Spring Harbor Press, 
Cold Spring Harbor, NY (1989). 

[0071] For example, DNA that encodes a T1R receptor may be obtained by screening mRNA, 
cDNA, or genomic DNA with oligonucleotide probes generated from the T1R gene sequence 
information provided herein. Probes may be labeled with a detectable group, such as a 
fluorescent group, a radioactive atom or a chemiluminescent group in accordance with 
procedures known to the skilled artisan and used in conventional hybridization assays, as 
described by, for example, Sambrook et ah 

[0072] A nucleic acid molecule comprising a T1R nucleotide sequence can alternatively be 
synthesized by use of the polymerase chain reaction (PCR) procedure, with the PCR 
oligonucleotide primers produced from the nucleotide sequences provided herein. See U.S. 
Patent Numbers 4,683,195 to Mullis et al and 4,683,202 to Mullis. The PCR reaction provides a 
method for selectively increasing the concentration of a particular nucleic acid sequence even 
when that sequence has not been previously purified and is present only in a single copy in a 
particular sample. The method can be used to amplify either single- or double-stranded DNA. 
The essence of the method involves the use of two oligonucleotide probes to serve as primers for 
the template-dependent, polymerase mediated replication of a desired nucleic acid molecule. 

[0073] A wide variety of alternative cloning and in vitro amplification methodologies are well 
known to those skilled in the art. Examples of these techniques are found in, for example, 
Berger et al, Guide to Molecular Cloning Techniques, METHODS IN ENZYMOLOGY 152, 
Academic Press, Inc., San Diego, CA (Berger), which is incorporated herein by reference in its 
entirety. 

[0074] The polynucleotides of the invention may be used in hybridization techniques known to 
those skilled in the art, including but not limited to, Northern and Southern blotting and overgo 
hybridization (see infra). For example, polynucleotide probes of the invention may be used in 
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tissue distribution studies and diagnostic assays. The T1R receptors of the invention are likely to 
be present and active in tissues other than those involved in taste perception. It is therefore likely 
that the feline T1R receptors serve multiple functions in vivo, such as, for example, regulation of 
amino acid metabolism in addition to taste perception. 

[0075] Automated sequencing methods can be used to obtain or verify the T1R receptor- 
encoding nucleotide sequence. The nucleotide sequences of the present invention are believed to 
be accurate. However, as is known in the art, nucleotide sequences obtained by automated 
methods may contain some errors. Nucleotide sequences determined by automation are typically 
at least about 90%, more typically at least about 95% to at least about 99.9% identical to the 
actual nucleotide sequence of a given nucleic acid molecule. The actual sequence may be more 
precisely determined using manual sequencing methods, which are well known in the art. An 
error in a sequence which results in an insertion or deletion of one or more nucleotides may 
result in a frame shift in translation such that the predicted amino acid sequence will differ from 
that which would be predicted from the actual nucleotide sequence of the nucleic acid molecule, 
starting at the point of the mutation. 

[0076] The nucleic acid molecules of the present invention, and fragments derived therefrom, 
are useful for screening for restriction fragment length polymorphism (RFLP) associated with 
certain disorders, as well as for genetic mapping. 

[0077] The polynucleotide sequence information provided by the invention makes possible 
large-scale expression of the encoded polypeptide by techniques well known and routinely 
practiced in the art. 

Vectors 

[0078] Another aspect of the present invention is directed to vectors, or recombinant 
expression vectors, comprising any of the nucleic acid molecules described above. Vectors are 
used herein either to amplify DNA or RNA encoding a T1R receptor and/or to express DNA 
which encodes a T1R receptor. Examples of vectors include, but are not limited to, plasmids, 
phages, cosmids, episomes, viral particles or viruses, and integratable DNA fragments (i.e., 
fragments integratable into the host genome by homologous recombination). Examples of viral 
particles include, but are not limited to, adenoviruses, baculoviruses, parvoviruses, herpesviruses, 
poxviruses, adeno-associated viruses, Semliki Forest viruses, vaccinia viruses, and retroviruses. 
Examples of expression vectors include, but are not limited to, pcDNA3 (Invitrogen) and pSVL 
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(Pharmacia Biotech). Other expression vectors include, but are not limited to, pSPORT™ 
vectors, pGEM™ vectors (Promega), pPROEXvectors™ (LTI, Bethesda, MD), Bluescript™ 
vectors (Stratagene), pQE™ vectors (Qiagen), pSE420™ (Invitrogen), and 
pYES2™(Invitrogen). 

[0079] Expression constructs may comprise TIR-encoding polynucleotides operably linked to 
an endogenous or exogenous expression control DNA sequence and a transcription terminator. 
Expression control DNA sequences include promoters, enhancers, operators, and regulatory 
element binding sites generally, and are typically selected based on the expression systems in 
which the expression construct is to be utilized. Promoter and enhancer sequences are generally 
selected for the ability to increase gene expression, while operator sequences are generally 
selected for the ability to regulate gene expression. Expression constructs of the invention may 
also include sequences encoding one or more selectable markers that permit identification of host 
cells bearing the construct. Expression constructs may also include sequences that facilitate, or 
promote, homologous recombination in a host cell. Constructs of the invention also may include 
sequences necessary for replication in a host cell. 

[0080] Expression constructs may be utilized for production of an encoded protein, but may 
also be utilized simply to amplify a TIR-encoding polynucleotide sequence. In some 
embodiments, the vector is an expression vector wherein a polynucleotide of the invention is 
operably linked to a polynucleotide comprising an expression control sequence. Autonomously 
replicating recombinant expression constructs such as plasmid and viral DNA vectors 
incorporating polynucleotides of the invention are also provided. Some expression vectors are 
replicable DNA constructs in which a DNA sequence encoding a T1R receptor is operably linked 
or connected to suitable control sequence(s) capable of effecting the expression of the receptor in 
a suitable host. Amplification vectors do not require expression control domains, but rather need 
only the ability to replicate in a host, such as conferred by an origin of replication, and a 
selection gene to facilitate recognition of transformants. The need for control sequences in the 
expression vector will vary depending upon the host selected and the transformation method 
chosen. Control sequences include a transcriptional promoter, an optional operator sequence to 
control transcription, a sequence encoding suitable mRNA ribosomal binding, and sequences 
which control the termination of transcription and translation. 

[0081] Vectors of the invention may contain a promoter that is recognized by the host 

organism. The promoter sequences of the present invention may be prokaryotic, eukaryotic, or 

viral. Examples of suitable prokaryotic sequences include the Pr and Pl promoters of 
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bacteriophage lambda (The Bacteriophage Lambda, Hershey, A. D., Ed., Cold Spring Harbor 
Press, Cold Spring Harbor, NY (1973), which is incorporated herein by reference in its entirety; 
Lambda II, Hendrix, R. W., Ed., Cold Spring Harbor Press, Cold Spring Harbor, NY (1980), 
which is incorporated herein by reference in its entirety), the trp, recA, heat shock, and lacZ 
promoters of E. coli, and the SV40 early promoter (Benoist et al Nature, 1981, 290, 304-310), 
which is incorporated herein by reference in its entirety. Additional promoters include, but are 
not limited to, mouse mammary tumor virus, long terminal repeat of human immunodeficiency 
virus, maloney virus, cytomegalovirus immediate early promoter, Epstein Barr virus, Rous 
sarcoma virus, human actin, human myosin, human hemoglobin, human muscle creatine, and 
human metallothionein. 

[0082] Additional regulatory sequences can also be included in vectors of the invention. 
Examples of suitable regulatory sequences are represented by the Shine-Dalgarno of the 
replicase gene of the phage MS-2 and of the gene ell of bacteriophage lambda. The Shine- 
Dalgarno sequence may be directly followed by DNA encoding a T1R3 receptor, resulting in the 
expression of the mature protein. 

[0083] Moreover, suitable expression vectors can include an appropriate marker that allows the 
screening of transformed host cells. The transformation of the selected host is carried out using 
any one of the various techniques well known to the expert in the art and described in Sambrook 
et aL, supra. 

[0084] An origin of replication or autonomously replicating sequence (ARS) can also be 
provided either by construction of the vector to include an exogenous origin or may be provided 
by the host cell chromosomal replication mechanism. If the vector is integrated into the host cell 
chromosome, the latter may be sufficient. Alternatively, rather than using vectors which contain 
viral origins of replication, one skilled in the art can transform mammalian cells by the method 
of co-transformation with a selectable marker and T1R DNA. An example of a suitable marker 
is dihydrofolate reductase (DHFR) or thymidine kinase (see, U.S. Patent No. 4,399,216). 

[0085] Additional regulatory sequences that may be included in the polynucleotides of the 
invention include secretion signals which allow the encoded polypeptide to cross and/or lodge in 
cell membranes, or be secreted from the cell. 

[0086] Nucleotide sequences encoding a T1R receptor may be recombined with vector DNA in 
accordance with conventional techniques, including blunt-ended or staggered-ended termini for 
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ligation, restriction enzyme digestion to provide appropriate termini, filling in of cohesive ends 
as appropriate, alkaline phosphatase treatment to avoid undesirable joining, and ligation with 
appropriate ligases. Techniques for such manipulation are disclosed by Sambrook et al. 9 supra 
and are well known in the art. Methods for construction of mammalian expression vectors are 
disclosed in, for example, Okayama et al, Mol Cell. Biol, 1983, 3, 280, Cosman et al. 9 MoL 
Immunol., 1986, 23, 935, Cosman et al, Nature, 1984, 312, 768, EP-A-0367566, and WO 
91/18982, each of which is incorporated herein by reference in its entirety. 

Host cells 

[0087] According to another aspect of the invention, host cells are provided, including 
prokaryotic and eukaryotic cells, comprising a polynucleotide of the invention (or vector of the 
invention) in a manner that permits expression of the encoded T1R polypeptide. Polynucleotides 
of the invention may be introduced into the host cell as part of a circular plasmid, or as linear 
DNA comprising an isolated protein-coding region or a viral vector. Methods for introducing 
DNA into the host cell that are well known and routinely practiced in the art include 
transformation, trans fection, electroporation, nuclear injection, or fusion with carriers such as 
liposomes, micelles, ghost cells, and protoplasts. Expression systems of the invention include 
bacterial, yeast, fungal, plant, insect, invertebrate, vertebrate, and mammalian cell systems. 

[0088] The invention provides host cells that are transformed or transfected (stably or 
transiently) with polynucleotides of the invention or vectors of the invention. As stated above, 
such host cells are useful for amplifying the polynucleotides and also for expressing a T1R 
polypeptide or fragment thereof encoded by the polynucleotide. 

[0089] In still another related embodiment, the invention provides a method for producing a 
T1R polypeptide (or fragment thereof) comprising the steps of growing a host cell of the 
invention in a nutrient medium and isolating the polypeptide or variant thereof from the cell or 
the medium. Because the T1R receptor is a membrane-spanning polypeptide, it will be 
appreciated that, for some applications, such as certain activity assays, the preferable isolation 
may involve isolation of cell membranes containing the polypeptide embedded therein, whereas 
for other applications a more complete isolation may be preferable. 

[0090] According to some aspects of the present invention, transformed host cells having an 
expression vector comprising any of the nucleic acid molecules described above are provided. 
Expression of the nucleotide sequence occurs when the expression vector is introduced into an 
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appropriate host cell. Suitable host cells for expression of the polypeptides of the invention 
include, but are not limited to, prokaryotes, yeast, and eukaryotes. If a prokaryotic expression 
vector is employed, then the appropriate host cell would be any prokaryotic cell capable of 
expressing the cloned sequences. Suitable prokaryotic cells include, but are not limited to, 
bacteria of the genera Escherichia, Bacillus, Salmonella, Pseudomonas, Streptomyces, and 
Staphylococcus . 

[0091] If a eukaryotic expression vector is employed, then the appropriate host cell would be 
any eukaryotic cell capable of expressing the cloned sequence. Eukaryotic cells may be cells of 
higher eukaryotes. Suitable eukaryotic cells include, but are not limited to, non-human 
mammalian tissue culture cells and human tissue culture cells. Host cells include, but are not 
limited to, insect cells, HeLa cells, Chinese hamster ovary cells (CHO cells), African green 
monkey kidney cells (COS cells), human HEK-293 cells, and murine 3T3 fibroblasts. 
Propagation of such cells in cell culture has become a routine procedure (see, TISSUE Culture, 
Academic Press, Kruse and Patterson, eds. (1973), which is incorporated herein by reference in 
its entirety). 

[0092] In addition, a yeast host may be employed as a host cell. Yeast cells include, but are 
not limited to, the genera Saccharomyces, Pichia, and Kluveromyces. Yeast hosts may be S. 
cerevisiae and P. pastoris. Yeast vectors may contain an origin of replication sequence from a 
2T yeast plasmid, an autonomously replication sequence (ARS), a promoter region, sequences 
for polyadenylation, sequences for transcription termination, and a selectable marker gene. 
Shuttle vectors for replication in both yeast and E. coli are also included herein. 

[0093] Alternatively, insect cells may be used as host cells. In some embodiments, the 
polypeptides of the invention are expressed using a baculovirus expression system (see, Luckow 
et al, Bio/Technology, 1988, 6, 47; BACULOVIRUS EXPRESSION VECTORS: A LABORATORY 
Manual, O'Reilly et al (Eds.), W.H. Freeman and Company, New York, 1992; and U.S. Patent 
No. 4,879,236, each of which is incorporated herein by reference in its entirety). In addition, the 
MAXBAC™ complete baculovirus expression system (Invitrogen) can, for example, be used for 
production in insect cells. 

[0094] Host cells of the invention are a valuable source of immunogen for development of 
antibodies specifically immunoreactive with the T1R3 receptor. Host cells of the invention also 
are useful in methods for the large-scale production of T1R3 polypeptides wherein the cells are 
grown in a suitable culture medium and the desired polypeptide products are isolated from the 
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cells, or from the medium in which the cells are grown, by purification methods known in the art, 
e.g., conventional chromatographic methods including immunoaffinity chromatography, receptor 
affinity chromatography, hydrophobic interaction chromatography, lectin affinity 
chromatography, size exclusion filtration, cation or anion exchange chromatography, high 
pressure liquid chromatography (HPLC), reverse phase HPLC, and the like. Still other methods 
of purification include those methods wherein the desired protein is expressed and purified as a 
fusion protein having a specific tag, label, or chelating moiety that is recognized by a specific 
binding partner or agent. The purified protein can be cleaved to yield the desired protein, or can 
be left as an intact fusion protein. Cleavage of the fusion component may produce a form of the 
desired protein having additional amino acid residues as a result of the cleavage process. 

[0095] Knowledge of the feline T1R receptor-encoding nucleotide sequence allows for 
modification of cells to permit, or increase, expression of endogenous receptor. Cells can be 
modified {e.g., by homologous recombination) to provide increased expression by replacing, in 
whole or in part, the naturally occurring T1R promoter with all or part of a heterologous 
promoter so that the cells express the receptor at higher or lower levels. The heterologous 
promoter is inserted in such a manner that it is operably linked to endogenous T1R coding 
sequence. (See, for example, PCT International Publication No. WO 94/12650, PCT 
International Publication No. WO 92/20808, and PCT International Publication No. 
WO 91/09955.) It is also contemplated that, in addition to heterologous promoter DNA, 
amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional CAD gene which encodes 
carbamoyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or intron 
DNA may be inserted along with the heterologous promoter DNA. If linked to the T1R coding 
sequence, amplification of the marker DNA by standard selection methods results in 
co-amplification of the T1R coding sequences in the cells. 

Knock-out and transplacemnent animals 

[0096] The DNA sequence information provided by the present invention also makes possible 
the development (e.g., by homologous recombination strategies; see Capecchi, Science 
244:1288-1292 (1989), which is incorporated herein by reference) of transgenic or gene-targeted 
animals, including, for example, animals that fail to express functional T1R3 ("knock-out") or 
that express a variant thereof ("transplacement'). Such animals (especially small laboratory 
animals such as rats, rabbits, mice, and cats) are useful as models for studying the in vivo 
activities of T1R receptors and modulators of T1R receptors. 
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Antisense and siRNA 

[0097] Also encompassed by the invention are antisense and short interfering polynucleotides 
that recognize and hybridize to polynucleotides encoding T1R receptors. Full-length and 
fragment antisense polynucleotides are provided. Fragment antisense molecules of the invention 
include (i) those that specifically recognize and hybridize to T1R RNA (as determined by 
sequence comparison of DNA encoding T1R receptor to DNA encoding other known 
molecules). Identification of sequences unique to TIR-encoding polynucleotides can be deduced 
through use of any publicly available sequence database, and/or through use of commercially 
available sequence comparison programs. After identification of the desired sequences, isolation 
through restriction digestion or amplification using any of the various polymerase chain reaction 
techniques well known in the art can be performed. Antisense polynucleotides are particularly 
relevant to regulation of expression of T1R receptor by those cells expressing T1R mRNA. 

[0098] Antisense nucleic acids (preferably 10 to 30 base-pair oligonucleotides) capable of 
specifically binding to T1R expression control sequences or T1R RNA are introduced into cells 
(e.g., by a viral vector or colloidal dispersion system such as a liposome). The antisense nucleic 
acid binds to the target nucleotide sequence in the cell and prevents transcription and/or 
translation of the target sequence. Phosphorothioate and methylphosphonate antisense 
oligonucleotides are specifically contemplated for therapeutic use by the invention. Locked 
nucleic acids are also specifically contemplated for therapeutic use by the present invention. 
(See, for example, Wahlestedt et aL, Proc. Natl Acad. Sci. USA, 97(10), 5633-5638 (2000), 
which is incorporated by reference in its entirety) The antisense oligonucleotides may be further 
modified by adding poly-L-lysine, transferrin polylysine, or cholesterol moieties at their 5' end. 
Suppression of T1R expression at either the transcriptional or translational level is useful to 
generate cellular or animal models for diseases/conditions characterized by aberrant T1R 
expression. 

[0099] Antisense oligonucleotides, or fragments of nucleotide sequence of SEQ ID NO: 1 , SEQ 
ID NO:59, SEQ ID NO:60, SEQ ID NO:62, or SEQ ID NO:63, or sequences complementary or 
homologous thereto, derived from the nucleotide sequences of the present invention encoding 
T1R receptors are useful as diagnostic tools for probing gene expression in various tissues. For 
example, tissue can be probed in situ with oligonucleotide probes carrying detectable groups by 
conventional autoradiography techniques to investigate native expression of this enzyme or 
pathological conditions relating thereto. Antisense oligonucleotides may be directed to 
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regulatory regions of a T1R nucleotide sequence, or mRNA corresponding thereto, including, but 
not limited to, the initiation codon, TATA box, enhancer sequences, and the like. 

[0100] Those of skill in the art recognize that the antisense oligonucleotides that inhibit the 
expression and/or biological activity of a T1R receptor may be predicted using any gene 
encoding a T1R receptor. Specifically, antisense nucleic acid molecules comprise a sequence 
preferably complementary to at least about 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 100, 250 or 500 
nucleotides or an entire T1R receptor gene sequence. The antisense oligonucleotides may 
comprise a sequence complementary to about 15 consecutive nucleotides of the coding strand of 
the T1R receptor-encoding sequence. 

[0101] In one embodiment, an antisense nucleic acid molecule is antisense to a "coding region" 
of the coding strand of a nucleotide sequence encoding a T1R protein. The coding strand may 
also include regulatory regions of the T1R sequence. The term "coding region" refers to the 
region of the nucleotide sequence comprising codons which are translated into amino acid 
residues. In another embodiment, the antisense nucleic acid molecule is antisense to a 
"noncoding region" of the coding strand of a nucleotide sequence encoding a T1R protein. The 
term "noncoding region" refers to 5' and 3' sequences which flank the coding region that are not 
translated into amino acids (i.e., also referred to as 5' and 3' untranslated regions (UTR)). 

[0102] Antisense oligonucleotides may be directed to regulatory regions of a nucleotide 
sequence encoding a T1R protein, or mRNA corresponding thereto, including, but not limited to, 
the initiation codon, TATA box, enhancer sequences, and the like. Given the coding strand 
sequences provided herein, antisense nucleic acids of the invention can be designed according to 
the rules of Watson and Crick or Hoogsteen base pairing. The antisense nucleic acid molecule 
can be complementary to the entire coding region of a T1R mRNA, but also may be an 
oligonucleotide that is antisense to only a portion of the coding or noncoding region of the 
mRNA. For example, the antisense oligonucleotide can be complementary to the region 
surrounding the translation start site of an mRNA. An antisense oligonucleotide can be, for 
example, about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in length. 

[0103] Another means to inhibit the activity of a T1R receptor according to the invention is via 
RNA interference (RNAi) (see e.g., Elbashir et aL, Nature, 411:494-498 (2001); Elbashir et ah, 
Genes Development, 15:188-200 (2001)). RNAi is the process of sequence-specific, post- 
transcriptional gene silencing, initiated by double-stranded RNA (dsRNA) that is homologous in 
sequence to the silenced gene (e.g., is homologous in sequence to the sequence encoding a T1R 
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receptor, for example but not limited to the sequence as set forth in SEQ ID NO:l, SEQ ID 
NO:59, SEQ ID NO:60, SEQ ID NO:62, or SEQ ID NO:63). siRNA-mediated silencing is 
thought to occur post-transcriptionally and/or transcriptionally. For example, siRNA duplexes 
may mediate post-transcriptional gene silencing by reconstitution of siRNA-protein complexes 
(siRNPs), which guide mRNA recognition and targeted cleavage. 

[0104] Accordingly, another form of a T1R inhibitory compound of the invention is a short 
interfering RNA (siRNA) directed against a TIR-encoding sequence. Exemplary siRNAs are 
siRNA duplexes (for example, 10-25, preferably 20, 21, 22, 23, 24, or 25 residues in length) 
having a sequence homologous or identical to a fragment of the T1R sequence set forth as SEQ 
ID NO:l, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:62, or SEQ ID NO:63 and having a 
symmetric 2-nucleotide 3'-overhang. The 2-nucleotide 3' overhang may be composed of (2 1 - 
deoxy) thymidine because it reduces costs of RNA synthesis and may enhance nuclease 
resistance of siRNAs in the cell culture medium and within transfected cells. Substitution of 
uridine by thymidine in the 3' overhang is also well tolerated in mammalian cells, and the 
sequence of the overhang appears not to contribute to target recognition. 

Polypeptides 

[0105] The invention also provides purified and isolated mammalian T1R receptor 
polypeptides encoded by a polynucleotide of the invention. Some embodiments include a feline 
T1R polypeptide comprising the amino acid sequence of SEQ ID NO:2, SEQ ID NO:61, or SEQ 
ID NO:64, or fragments thereof comprising an epitope specific to the polypeptide. A reference 
to "epitope specific to" or "polypeptide-specific epitope," or variations thereof, indicates that a 
portion of the T1R receptor or amino acid sequence is recognizable by an antibody that is 
specific for the T1R or amino acid sequence. 

[0106] Included within the scope of the invention are polypeptides encoded by feline allelic 
variants of T1R. The allelic variants of the T1R receptor of the invention may modify the taste 
perception of a mammal, such as a cat, to a taste stimulus. Such functional amino acid sequence 
modifications may account for differences in intraspecies (e.g., breed- specific) taste perception. 

[0107] Extracellular epitopes are useful for generating and screening for antibodies and other 
binding compounds that bind to a T1R receptor. Thus, in another embodiment, the invention 
provides a purified and isolated polypeptide comprising at least one extracellular domain of the 
T1R receptor. Also included is a polypeptide comprising a T1R receptor fragment selected from 
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the group consisting of an extracellular domain of T1R3 (residues 1-571, 628-641, 705-730, and 
787-794 of SEQ ID NO:2), a transmembrane domain of T1R3 (residues 572-594, 610-627, 642- 
664, 681-704, 731-754, 767-780, and 795-812 of SEQ ID NO:2), an intracellular domain of 
T1R3 (residues 595-609, 665-680, 755-766, and 813-865 of SEQ ID NO:2), an extracellular 
domain of the T1R1 receptor (residues 1-563, 624-635, 701-726, and 781-792 of SEQ ID 
NO:61), a transmembrane domain of the T1R1 receptor (residues 564-589, 604-623, 636-660, 
681-700, 727-748, 761-780, and 793-817 of SEQ ID NO:61), an intracellular domain of the 
T1R1 receptor (residues 590-603, 661-680, 749-760, and 818-841 of SEQ ID NO:61), an 
extracellular domain of T1R2 (residues 1-147 of SEQ ID NO:64), a transmembrane domain of a 
T1R2 receptor (residues 148-167 of SEQ ID NO:64) 5 and an intracellular domain of a T1R2 
receptor (residues 168-391 of SEQ ID NO:64). Polypeptide fragments of the invention may be 
continuous portions of the native receptor. However, it will also be appreciated that knowledge 
of the T1R genes and protein sequences as provided herein permits recombination of various 
domains that are not contiguous in the native protein. 

[0108] The invention embraces polypeptides that preferably have at least 99%, at least 95%, at 
least 90%, at least 85%, at least 80%, at least 75%, at least 74%, at least 73%, at least 72%, at 
least 71%, at least 70%, at least 65%, at least 60%, at least 55% or at least 50% identity and/or 
homology to the polypeptides of the invention. 

[0109] Polypeptides of the invention may be isolated from natural cell sources or may be 
chemically synthesized, but are preferably produced by recombinant procedures involving host 
cells of the invention. Use of mammalian host cells is expected to provide for such 
post-translational modifications (e.g., glycosylation, truncation, lipidation, and phosphorylation) 
as may be needed to confer optimal biological activity on recombinant expression products of the 
invention. 

[0110] The invention also embraces variant T1R polypeptides. In one example, insertion 
variants are provided wherein one or more amino acid residues supplement a T1R amino acid 
sequence such as SEQ ID NO:2, SEQ ID NO:61, or SEQ ID NO:64. Insertions may be located 
at either or both termini of the protein, or may be positioned within internal regions of the amino 
acid sequence. Insertional variants with additional residues at either or both termini can include, 
for example, fusion proteins and proteins including amino acid tags or labels. 

[0111] Insertion variants include T1R polypeptides wherein one or more amino acid residues 
are added to a biologically active fragment thereof For example, the insertion variants of the 
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invention include chimeric T1R receptors wherein at least one functional domain of a feline T1R 
receptor of the invention is present. 

[0112] The invention also embraces T1R variants having additional amino acid residues that 
result from use of specific expression systems. For example, use of commercially available 
vectors that express a desired polypeptide as part of a glutathione-S-transferase (GST) fusion 
product provides the desired polypeptide having an additional glycine residue at position -1 after 
cleavage of the GST component from the desired polypeptide. Variants that result from 
expression in other vector systems are also contemplated. 

[0113] In another aspect, the invention provides deletion variants wherein one or more amino 
acid residues in a T1R polypeptide are removed. Deletions can be effected at one or both termini 
of the T1R polypeptide, or with removal of one or more non-terminal amino acid residues of 
T1R. Deletion variants, therefore, include all fragments of a T1R polypeptide. 

[0114] The invention also embraces polypeptide fragments that maintain biological (e.g., 
ligand binding, heterodimerization) and/or immunological properties of a T1R polypeptide. 

[0115] As used in the present invention, polypeptide fragments preferably comprise at least 10, 
15, 20, 25, 30, 35, 40, 45, or 50 consecutive amino acids of SEQ ID NO:2, SEQ ID NO:61, or 
SEQ ED NO:64. Some polypeptide fragments display antigenic properties unique to, or specific 
for, a feline T1R receptor. Fragments of the invention having the desired biological and 
immunological properties can be prepared by any of the methods well known and routinely 
practiced in the art. 

[0116] In still another aspect, the invention provides substitution variants of T1R polypeptides. 
Substitution variants include those polypeptides wherein one or more amino acid residues of a 
T1R polypeptide are removed and replaced with alternative residues. In one aspect, the 
substitutions are conservative in nature; however, the invention embraces substitutions that are 
also non-conservative. Conservative substitutions for this purpose may be defined as set out in 
Tables 1,2, or 3 below. 

[0117] Variant polypeptides include those wherein conservative substitutions have been 
introduced by modification of polynucleotides encoding polypeptides of the invention. Amino 
acids can be classified according to physical properties and contribution to secondary and tertiary 
protein structure. A conservative substitution is recognized in the art as a substitution of one 
amino acid for another amino acid that has similar properties. Exemplary conservative 
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substitutions are set out in Table 1 (from WO 97/09433, page 10, published March 13, 1997 
(PCT/GB96/02197, filed 9/6/96), immediately below. 
Table 1 

Conservative Substitutions I 

SIDE CHAIN 
CHARACTERISTIC AMINO ACID 

Aliphatic 

Non-polar GAP 

I LV 

Polar - uncharged C S T M 

NQ 

Polar - charged D E 

KR 

Aromatic H F W Y 

Other N Q D E 

Alternatively, conservative amino acids can be grouped as described in Lehninger, 
[Biochemistry, Second Edition; Worth Publishers, Inc. NY, NY (1975), pp.71-77] as set out in 
Table 2, below. 
Table 2 

Conservative Substitutions II 

SIDE CHAIN 
CHARACTERISTIC AMINO ACID 

Non-polar (hydrophobic) 

A. Aliphatic: A LI VP 

B. Aromatic: FW 

C. Sulfiir-containing: M 

D. Borderline: G 
Uncharged-polar 

A. Hydroxyl: STY 

B. Amides: NQ 

C. Sulfhydryl: C 

D. Borderline: G 
Positively Charged (Basic): K R H 
Negatively Charged (Acidic): D E 

As still another alternative, exemplary conservative substitutions are set out in Table 3, below. 
Table 3 

Conservative Substitutions III 

Original Exemplary Substitution 

Residue 

Ala (A) Val, Leu, He 

Arg (R) Lys, Gin, Asn 

Asn (N) Gin, His, Lys, Arg 
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Asp(D) 


Glu 


Cys(C) 


Ser 


Gin (Q) 


Asn 


Glu(E) 


Asp 


His (H) 


Asn, Gin, Lys, Arg 


He (I) 


Leu, Val, Met, Ala, Phe, 


Leu (L) 


He, Val, Met, Ala, Phe 


Lys(K) 


Arg, Gin, Asn 


Met (M) 


Leu, Phe, lie 


Phe (F) 


Leu, Val, He, Ala 


Pro (P) 


Gly 


Ser (S) 


Thr 


Thr(T) 


Ser 


Trp(W) 


Tyr 


Tyr(Y) 


Trp, Phe, Thr, Ser 


Val (V) 


lie, Leu, Met, Phe, Ala 



[0118] It should be understood that the definition of polypeptides of the invention is intended 
to include polypeptides bearing modifications other than insertion, deletion, or substitution of 
amino acid residues. By way of example, the modifications may be covalent in nature, and 
include for example, chemical bonding with polymers, lipids, other organic, and inorganic 
moieties. Such derivatives may be prepared to increase circulating half-life of a polypeptide, or 
may be designed to improve the targeting capacity of the polypeptide for desired cells, tissues, or 
organs. Similarly, the invention further embraces T1R polypeptides that have been covalently 
modified to include one or more water-soluble polymer attachments such as polyethylene glycol, 
polyoxyethylene glycol, or polypropylene glycol. Variants that display ligand binding properties 
of native T1R and are expressed at higher levels, as well as variants that provide for 
constitutively active receptors, are particularly useful in assays of the invention; the variants are 
also useful in providing cellular, tissue and animal models of diseases/conditions characterized 
by aberrant T1R activity. 

[0119] In a related embodiment, the present invention provides compositions comprising 
purified polypeptides of the invention. Some compositions comprise, in addition to the 
polypeptide of the invention, a pharmaceutically acceptable (i.e., sterile and non-toxic) liquid, 
semisolid, or solid diluent that serves as a pharmaceutical vehicle, excipient, or medium. Any 
diluent known in the art may be used. Exemplary diluents include, but are not limited to, water, 
saline solutions, polyoxyethylene sorbitan monolaurate, magnesium stearate, methyl- and 
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propylhydroxybenzoate, talc, alginates, starches, lactose, sucrose, dextrose, sorbitol, mannitol, 
glycerol, calcium phosphate, mineral oil, and cocoa butter. 

[0120] Variants that display ligand-binding properties of native T1R and are expressed at 
higher levels, as well as variants that provide for constitutively active receptors, are particularly 
useful in assays of the invention; the variants are also useful in assays of the invention and in 
providing cellular, tissue and animal models of diseases/conditions characterized by aberrant 
T1R activity. 

Antibodies 

[0121] Also included in the present invention are antibodies {e.g., monoclonal and polyclonal 
antibodies, single chain antibodies, chimeric antibodies, bifunctional/bispecific antibodies, 
humanized antibodies, human antibodies, felinized antibodies, feline antibodies, and 
complementary determining region (CDR)-grafted antibodies, including compounds which 
include CDR sequences which specifically recognize a polypeptide of the invention) specific for 
a T1R receptor of the invention or fragments thereof. Antibody fragments, including Fab, Fab ', 
F(ab') 2 , and F v? are also provided by the invention. The term "specific for," when used to 
describe antibodies of the invention, indicates that the variable regions of the antibodies of the 
invention recognize and bind T1R polypeptides, preferably exclusively (i.e., are able to 
distinguish T1R polypeptides of the invention from other known polypeptides by virtue of 
measurable differences in binding affinity, despite the possible existence of localized sequence 
identity, homology, or similarity between T1R and such polypeptides). It will be understood that 
specific antibodies may also interact with other proteins (for example, S. aureus protein A or 
other antibodies in ELISA techniques) through interactions with sequences outside the variable 
region of the antibodies, and, in particular, in the constant region of the molecule. Screening 
assays to determine binding specificity of an antibody of the invention are well known and 
routinely practiced in the art. For a comprehensive discussion of such assays, see Harlow et al 
(Eds.), Antibodies A Laboratory Manual; Cold Spring Harbor Laboratory; Cold Spring 
Harbor, NY (1988), Chapter 6. Antibodies that recognize and bind fragments of the T1R 
polypeptides of the invention are also contemplated, provided that the antibodies are specific for 
T1R polypeptides. Antibodies of the invention can be produced using any method well known 
and routinely practiced in the art. 

[0122] The invention provides an antibody that is specific for the feline T1R receptors of the 
invention. Antibodies that can be generated from polypeptides that have previously been 
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described in the literature and that are capable of fortuitously cross-reacting with feline T1R 
receptor (e.g., due to the fortuitous existence of a similar epitope in both polypeptides) are 
considered "cross-reactive" antibodies. Such cross-reactive antibodies are not antibodies that are 
"specific" for a feline T1R receptor. The determination of whether an antibody is specific for a 
feline T1R receptor or is cross-reactive with another known receptor is made using any of 
several assays, such as Western blotting assays, that are well known in the art. For identifying 
cells that express a T1R receptor and also for modulating TIR-ligand binding activity, antibodies 
that specifically bind to an extracellular epitope of the T1R receptor may be used. 

[0123] In some variations, the invention provides monoclonal antibodies. Hybridomas that 
produce such antibodies also are intended as aspects of the invention. In yet another variation, 
the invention provides a felinized antibody. Felinized antibodies are useful for in vivo 
therapeutic indications. 

[0124] In another variation, the invention provides a cell-free composition comprising 
polyclonal antibodies, wherein at least one of the antibodies is an antibody of the invention 
specific for T1R receptor. Antisera isolated from an animal is an exemplary composition, as is a 
composition comprising an antibody fraction of an antisera that has been resuspended in water or 
in another diluent, excipient, or carrier. 

[0125] In still another related embodiment, the invention provides an anti-idiotypic antibody 
specific for an antibody that is specific for T1R receptor of the invention. 

[0126] It is well known that antibodies contain relatively small antigen binding domains that 
can be isolated chemically or by recombinant techniques. Such domains are useful T1R receptor 
binding molecules themselves, and also may be reintroduced into other antibodies or fused to 
toxins or other polypeptides. Thus, in still another embodiment, the invention provides a 
polypeptide comprising a fragment of a T1R- specific antibody, wherein the fragment and the 
polypeptide bind to the T1R receptor. By way of non-limiting example, the invention provides 
polypeptides that are single chain antibodies and CDR-grafted antibodies. 

[0127] Non-feline antibodies may be felinized by any of the methods known in the art. In one 
method, the non-feline CDRs are inserted into a feline antibody or consensus antibody 
framework sequence. Similarly, non-human antibodies may be humanized by methods known in 
the art. In one embodiment, non-human CDRs are inserted into a human antibody or consensus 
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antibody framework sequence. Further changes can then be introduced into the antibody 
framework to modulate affinity or imrnunogenicity. 

[0128] Antibodies of the invention are useful for, e.g., therapeutic purposes (such as by 
modulating activity of T1R receptor), diagnostic purposes (such as detecting or quantitating T1R 
receptor activity), and also for purification of T1R receptor. Kits comprising an antibody of the 
invention for any of the purposes described herein are also included within the scope of the 
invention. In general, a kit of the invention preferably includes a control antigen for which the 
antibody is immunospecific. 

Compositions 

[0129] Mutations in the T1R gene that result in loss of normal function of the T1R gene 
product underlie some TIR-related disease states. The invention comprehends gene and peptide 
therapy, for example, to restore T1R activity to treat those disease states. Delivery of a 
functional T1R gene to appropriate cells is effected ex vivo, in situ, or in vivo by use of vectors, 
and more particularly viral vectors (e.g., adenovirus, adeno-associated virus, or a retrovirus), or 
ex vivo by use of physical DNA transfer methods (e.g., liposomes or chemical treatments). See, 
for example, Anderson, Nature, supplement to vol. 392, No. 6679, pp. 25-20 (1998). For 
additional reviews of gene therapy technology see Friedmann, Science, 244: 1275-1281 (1989); 
Verma, Scientific American: 68-84 (1990); and Miller, Nature, 357: 455-460 (1992). 
Alternatively, it is contemplated that in other disease states, preventing the expression of, or 
inhibiting the activity of, T1R receptor will be useful in treatment. It is contemplated that 
antisense therapy or gene therapy could be applied to negatively regulate the expression of T1R 
receptor. 

[0130] Another aspect of the present invention is directed to compositions, including 
pharmaceutical compositions, comprising any of the nucleic acid molecules or recombinant 
expression vectors described above and an acceptable carrier or diluent. The carrier or diluent 
may be pharmaceutically acceptable. Suitable carriers are described in the most recent edition of 
Remington r s Pharmaceutical Sciences, A. Osol, a standard reference text in this field, which is 
incorporated herein by reference in its entirety. Examples of such carriers or diluents include, 
but are not limited to, water, saline, Ringer's solution, dextrose solution, and 5% serum albumin. 
Liposomes and nonaqueous vehicles such as fixed oils may also be used. The formulations may 
be sterilized by commonly used techniques. 
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[0131] Also within the scope of the invention are compositions comprising polypeptides, 
polynucleotides, or antibodies of the invention that have been formulated with, e.g., sl 
pharmaceutical^ acceptable carrier. 

[0132] The invention also provides methods of using antibodies of the invention. For example, 
the invention provides a method for modulating ligand-binding of a T1R receptor comprising the 
step of contacting the receptor with an antibody specific for the T1R polypeptide, under 
conditions wherein the antibody binds the receptor. 

Methods of identifying ligands and modulators 

[0133] The invention also provides assays to identify compounds that bind and/or modulate 
T1R receptor. A "T1R binding partner" is a compound that directly or indirectly binds a T1R 
polypeptide of the invention. One assay of the invention comprises the steps of: (a) contacting 
T1R receptor with a compound suspected of binding T1R receptor (the test compound); and (b) 
measuring binding between the compound and the T1R receptor. In one variation, the 
composition comprises a cell expressing T1R receptor on its surface. In another variation, 
isolated T1R receptor or cell membranes comprising T1R receptor are employed. The binding 
may be measured directly, e.g., by using a labeled compound, or may be measured indirectly. 
Compounds identified as binding T1R receptor may be further tested in other assays including, 
but not limited to, T1R activity assays and/or in vivo models, in order to confirm or quantitate 
their activity. 

[0134] Specific binding molecules, including natural ligands and synthetic compounds, can be 
identified or developed using isolated or recombinant T1R products, T1R variants, or preferably, 
cells expressing such products. Binding partners are useful for purifying T1R products and 
detection or quantification of T1R products in fluid and tissue samples using known 
immunological procedures. Binding molecules are also manifestly useful in modulating (i.e., 
blocking, inhibiting or stimulating) biological activities of T1R, especially those activities 
involved in signal transduction. 

[0135] The DNA and amino acid sequence information provided by the present invention also 
makes possible identification of binding partner compounds with which a T1R polypeptide or 
polynucleotide will interact. Methods to identify binding partner compounds include solution 
assays, in vitro assays wherein T1R polypeptides are immobilized, and cell-based assays. 
Identification of binding partner compounds of T1R polypeptides provides candidates for 
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therapeutic or prophylactic intervention in pathologies associated with T1R normal and aberrant 
biological activity. 

[0136] The invention includes several assay systems for identifying TIR-binding partners. In 
solution assays, methods of the invention comprise the steps of (a) contacting a T1R polypeptide 
with one or more candidate binding partner compounds and (b) identifying the compounds that 
bind to the T1R polypeptide. Identification of the compounds that bind the T1R polypeptide can 
be achieved by isolating the T1R polypeptide/binding partner complex, and separating the 
binding partner compound from the T1R polypeptide. An additional step of characterizing the 
physical, biological, and/or biochemical properties of the binding partner compound is also 
comprehended in another embodiment of the invention. In one aspect, the T1R 
polypeptide/binding partner complex is isolated using an antibody immunospecific for either the 
T1R polypeptide or the candidate binding partner compound. 

[0137] In still other embodiments, either the T1R polypeptide or the candidate binding partner 
compound comprises a label or tag that facilitates its isolation, and methods of the invention to 
identify binding partner compounds include a step of isolating the T1R polypeptide/binding 
partner complex through interaction with the label or tag. An exemplary tag of this type is a 
poly-histidine sequence, generally around six histidine residues, that permits isolation of a 
compound so labeled using nickel chelation. Other labels and tags, such as the FLAG® tag 
(Eastman Kodak, Rochester, NY), well known and routinely used in the art, are embraced by the 
invention. 

[0138] In one variation of an in vitro assay, the invention provides a method comprising the 
steps of (a) contacting an immobilized T1R polypeptide with a candidate binding partner 
compound and (b) detecting binding of the candidate compound to the T1R polypeptide. In an 
alternative embodiment, the candidate binding partner compound is immobilized and binding of 
T1R receptor is detected. Immobilization is accomplished using any of the methods well known 
in the art, including covalent bonding to a support, a bead, or a chromatographic resin, as well as 
non-covalent, high affinity interactions such as antibody binding, or use of streptavidin/biotin 
binding wherein the immobilized compound includes a biotin moiety. The support may, for 
example, be formulated into a feline-specific electronic tongue. Detection of binding can be 
accomplished (i) using a radioactive label on the compound that is not immobilized, (ii) using a 
fluorescent label on the non-immobilized compound, (iii) using an antibody immunospecific for 
the non-immobilized compound, (iv) using a label on the non-immobilized compound that 
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excites a fluorescent support to which the immobilized compound is attached, as well as other 
techniques well known and routinely practiced in the art. 

[0139] The invention also provides cell-based assays to identify binding partner compounds of 
a T1R polypeptide. In one embodiment, the invention provides a method comprising the steps of 
contacting a T1R polypeptide expressed on the surface of a cell with a candidate binding partner 
compound and detecting binding of the candidate binding partner compound to the T1R 
polypeptide. In some embodiments, the detection comprises detecting physiological event in the 
cell caused by the binding of the molecule. 

[0140] Another aspect of the present invention is directed to methods of identifying 
compounds that bind to either T1R receptor or nucleic acid molecules encoding T1R receptor, 
comprising contacting T1R receptor, or a nucleic acid molecule encoding the same, with a 
compound, and determining whether the compound binds T1R receptor or a nucleic acid 
molecule encoding the same. Binding can be determined by binding assays which are well 
known to the skilled artisan, including, but not limited to, gel-shift assays, Western blots, 
radiolabeled competition assay, phage-based expression cloning, co-fractionation by 
chromatography, co-precipitation, cross-linking, interaction trap/two-hybrid analysis, 
southwestern analysis, ELISA, and the like, which are described in, for example, Current 
Protocols in Molecular Biology, 1999, John Wiley & Sons, NY, which is incorporated 
herein by reference in its entirety. The compounds to be screened include (which may include 
compounds which are suspected to bind T1R receptor, or a nucleic acid molecule encoding the 
same), but are not limited to, extracellular, intracellular, biological, or chemical origin. The 
methods of the invention also embrace ligands, especially neuropeptides, that are attached to a 
label, such as a radiolabel (e.g., 125 I, 35 S, 32 P, 33 P, 3 H), a fluorescence label, a chemiluminescent 
label, an enzymic label, and an immunogenic label. Modulators falling within the scope of the 
invention include, but are not limited to, non-peptide molecules such as non-peptide mimetics, 
non-peptide allosteric effectors, and peptides. The T1R polypeptide or polynucleotide employed 
in such a test may either be free in solution, attached to a solid support, borne on a cell surface or 
located intracellularly, or associated with a portion of a cell. One skilled in the art can, for 
example, measure the formation of complexes between T1R receptor and the compound being 
tested. Alternatively, one skilled in the art can examine the diminution in complex formation 
between T1R receptor and its substrate caused by the compound being tested. In some 
embodiments of the invention, the recognition sites of the T1R receptor are coupled with a 
monitoring system, either electrical or optical. An appropriate chemical stimulus can bind to the 
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receptor's ligand binding domain, changing the receptor conformation to a degree that the 
coupled electronics or optical changes can be observed on a read-out. Such a device could be 
developed into a feline-specific electronic tongue, for example. 

[0141] In another embodiment of the invention, high throughput screening for compounds 
having suitable binding affinity to T1R receptor is employed. Briefly, large numbers of different 
small peptide test compounds are synthesized on a solid substrate. The peptide test compounds 
are contacted with T1R receptor and washed. Bound T1R receptor is then detected by methods 
well known in the art. Purified polypeptides of the invention can also be coated directly onto 
plates for use in the aforementioned drug screening techniques. In addition, non-neutralizing 
antibodies can be used to capture the protein and immobilize it on the solid support. 

[0142] Generally, an expressed T1R receptor can be used for HTS binding assays in 
conjunction with a ligand, such as an amino acid or carbohydrate. The identified peptide is 
labeled with a suitable radioisotope, including, but not limited to, 125 I, 3 H, 35 S or 32 P, by methods 
that are well known to those skilled in the art. Alternatively, the peptides may be labeled by 
well-known methods with a suitable fluorescent derivative (Baindur et al, Drug Dev. Res., 1994, 
33, 373-398; Rogers, Drug Discovery Today, 1997, 2, 156-160). Radioactive ligand specifically 
bound to the receptor in membrane preparations made from the cell line expressing the 
recombinant protein can be detected in HTS assays in one of several standard ways, including 
filtration of the receptor-ligand complex to separate bound ligand from unbound ligand 
(Williams, Med. Res. Rev., 1991, 11, 147-184; Sweetnam et al, J. Natural Products, 1993, 56, 
441-455). Alternative methods include a scintillation proximity assay (SPA) or a FlashPlate 
format in which such separation is unnecessary (Nakayama, Cur. Opinion Drug Disc. Dev., 
1998, 1, 85-91; Bosse et al, J. Biomolecular Screening, 1998, 3, 285-292.). Binding of 
fluorescent ligands can be detected in various ways, including fluorescence energy transfer 
(FRET), direct spectrophotofluorometric analysis of bound ligand, or fluorescence polarization 
(Rogers, Drug Discovery Today, 1997, 2, 156-160; Hill, Cur. Opinion Drug Disc. Dev., 1998, 1, 
92-97). 

[0143] Other assays may be used to identify specific ligands of a T1R receptor, including 
assays that identify ligands of the target protein through measuring direct binding of test ligands 
to the target protein, as well as assays that identify ligands of target proteins through affinity 
ultrafiltration with ion spray mass spectroscopy/HPLC methods or other physical and analytical 
methods. Alternatively, such binding interactions are evaluated indirectly using the yeast two- 
hybrid system described in Fields et al., Nature, 340:245-246 (1989), and Fields et al., Trends in 
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Genetics, 10:286-292 (1994), both of which are incorporated herein by reference. The two- 
hybrid system is a genetic assay for detecting interactions between two proteins or polypeptides. 
It can be used to identify proteins that bind to a known protein of interest, or to delineate 
domains or residues critical for an interaction. Variations on this methodology have been 
developed to clone genes that encode DNA binding proteins, to identify peptides that bind to a 
protein, and to screen for drugs. The two-hybrid system exploits the ability of a pair of 
interacting proteins to bring a transcription activation domain into close proximity with a DNA 
binding domain that binds to an upstream activation sequence (UAS) of a reporter gene, and is 
generally performed in yeast. The assay requires the construction of two hybrid genes encoding 
(1) a DNA-binding domain that is fused to a first protein and (2) an activation domain fused to a 
second protein. The DNA-binding domain targets the first hybrid protein to the UAS of the 
reporter gene; however, because most proteins lack an activation domain, this DNA-binding 
hybrid protein does not activate transcription of the reporter gene. The second hybrid protein, 
which contains the activation domain, cannot by itself activate expression of the reporter gene 
because it does not bind the UAS. However, when both hybrid proteins are present, the 
noncovalent interaction of the first and second proteins tethers the activation domain to the UAS, 
activating transcription of the reporter gene. For example, when the first protein is a receptor, or 
fragment thereof, that is known to interact with another protein or nucleic acid, this assay can be 
used to detect agents that interfere with the binding interaction. Expression of the reporter gene 
is monitored as different test agents are added to the system. The presence of an inhibitory agent 
results in lack of a reporter signal. 

[0144] The yeast two-hybrid assay can also be used to identify proteins that bind to the gene 
product. In an assay to identify proteins that bind to a T1R receptor, or fragment thereof, a 
fusion polynucleotide encoding both a T1R receptor (or fragment) and a UAS binding domain 
{i.e., a first protein) may be used. In addition, a large number of hybrid genes each encoding a 
different second protein fused to an activation domain are produced and screened in the assay. 
Typically, the second protein is encoded by one or more members of a total cDNA or genomic 
DNA fusion library, with each second protein-coding region being fused to the activation 
domain. This system is applicable to a wide variety of proteins, and it is not necessary to know 
the identity or function of the second binding protein. The system is highly sensitive and can 
detect interactions not revealed by other methods; even transient interactions may trigger 
transcription to produce a stable mRNA that can be repeatedly translated to yield the reporter 
protein. 
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[0145] Other assays may be used to search for agents that bind to the target protein. One such 
screening method to identify direct binding of test ligands to a target protein is described in U.S. 
Patent No. 5,585,277, incorporated herein by reference. This method relies on the principle that 
proteins generally exist as a mixture of folded and unfolded states, and continually alternate 
between the two states. When a test ligand binds to the folded form of a target protein (i.e., 
when the test ligand is a ligand of the target protein), the target protein molecule bound by the 
ligand remains in its folded state. Thus, the folded target protein is present to a greater extent in 
the presence of a test ligand which binds the target protein, than in the absence of a ligand. 
Binding of the ligand to the target protein can be determined by any method that distinguishes 
between the folded and unfolded states of the target protein. The function of the target protein 
need not be known in order for this assay to be performed. Virtually any agent can be assessed 
by this method as a test ligand, including, but not limited to, metals, polypeptides, proteins, 
lipids, polysaccharides, polynucleotides and small organic molecules. 

[0146] Another method for identifying ligands of a target protein is described in Wieboldt et 
aL, Anal Chem., 69:1683-1691 (1997), incorporated herein by reference. This technique screens 
combinatorial libraries of 20-30 agents at a time in solution phase for binding to the target 
protein. Agents that bind to the target protein are separated from other library components by 
simple membrane washing. The specifically selected molecules that are retained on the filter are 
subsequently liberated from the target protein and analyzed by HPLC and pneumatically assisted 
electrospray (ion spray) ionization mass spectroscopy. This procedure selects library 
components with the greatest affinity for the target protein, and is particularly useful for small 
molecule libraries. 

[0147] Other embodiments of the invention comprise using competitive screening assays in 
which neutralizing antibodies capable of binding a polypeptide of the invention specifically 
compete with a test compound for binding to the polypeptide. In this manner, the antibodies can 
be used to detect the presence of any peptide that shares one or more antigenic determinants with 
T1R receptor. Radiolabeled competitive binding studies are described in A.H. Lin et al.> 
Antimicrobial Agents and Chemotherapy^ 1997, 41(10): 2127-2131, the disclosure of which is 
incorporated herein by reference in its entirety. 

[0148] Another aspect of the present invention is directed to methods of identifying 

compounds that modulate (Le. t increase or decrease) activity of T1R receptor comprising 

contacting T1R receptor with a compound, and determining whether the compound modifies 

activity of T1R receptor. The activity in the presence of the test compound is compared to the 

- 45 - 



MON-0298 



activity in the absence of the test compound. Where the activity of the sample containing the test 
compound is higher than the activity in the sample lacking the test compound, the compound is 
an agonist. Similarly, where the activity of the sample containing the test compound is lower 
than the activity in the sample lacking the test compound, the compound is an antagonist. 

[0149] Agents that modulate {i.e., increase, decrease, or block) T1R receptor activity or 
expression also may be identified, for example, by incubating a putative modulator with a cell 
containing a T1R polypeptide or polynucleotide and determining the effect of the putative 
modulator on T1R receptor activity or expression. The selectivity of a compound that modulates 
the activity of T1R receptor can be evaluated by comparing its effects on T1R receptor to its 
effect on other T1R receptors. Selective modulators may include, for example, antibodies and 
other proteins, peptides, or organic molecules that specifically bind to a T1R polypeptide or a 
T1R receptor-encoding nucleic acid. Modulators of T1R receptor activity will be therapeutically 
useful in treatment of diseases and physiological conditions in which normal or aberrant T1R 
receptor activity is involved. Compounds identified as modulating T1R receptor activity may be 
further tested in other assays including, but not limited to, in vivo models, in order to confirm or 
quantitate their activity. 

[0150] The invention also provides methods for identifying a T1R receptor modulator by: (a) 
contacting a T1R receptor binding partner and a composition comprising a T1R receptor in the 
presence and in the absence of a putative modulator compound; (b) detecting binding between 
the binding partner and the T1R receptor; and (c) identifying a putative modulator compound or 
a modulator compound in view of decreased or increased binding between the binding partner 
and the T1R receptor in the presence of the putative modulator, as compared to binding in the 
absence of the putative modulator. Compounds identified as modulators of binding between 
T1R receptor and a T1R binding partner may be further tested in other assays including, but not 
limited to, in vivo models, in order to confirm or quantitate their activity. 

[0151] The invention also includes within its scope high-throughput screening (HTS) assays to 
identify compounds that interact with, enhance, or inhibit biological activity {i.e., affect 
enzymatic activity, binding activity, etc.) of a T1R polypeptide. HTS assays permit screening of 
large numbers of compounds in an efficient manner. Cell-based HTS systems are contemplated 
to investigate T1R receptor-ligand interaction. HTS assays are designed to identify "hits" or 
"lead compounds" having the desired property, from which modifications can be designed to 
improve the desired property. Chemical modification of the "hit" or "lead compound" is often 
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based on an identifiable structure/activity relationship between the "hit" and the T1R3 
polypeptide. 

[0152] For example, modulators of T1R receptor activity may be identified by expressing the 
T1R receptor in a heterologous cultured mammalian cell line, such as HEK cells, and detecting 
receptor activity in the presence and absence of a test compound by monitoring changes in 
intracellular calcium using a calcium-specific intracellular dye. In another embodiment, this 
process may be automated using a high-throughput screening device. 

[0153] Candidate modulators contemplated by the invention include compounds selected from 
libraries of either potential activators or potential inhibitors. There are a number of different 
libraries used for the identification of small molecule modulators, including: (1) chemical 
libraries, (2) natural product libraries, and (3) combinatorial libraries comprised of random 
peptides, oligonucleotides, or organic molecules. Chemical libraries consist of random chemical 
structures, some of which are analogs of known compounds or analogs of compounds that have 
been identified as "hits'' or "leads" in other drug discovery screens, some of which are derived 
from natural products, and some of which arise from non-directed synthetic organic chemistry. 
Natural product libraries are collections of microorganisms, animals, plants, or marine organisms 
that are used to create mixtures for screening by: (1) fermentation and extraction of broths from 
soil, plant, or marine microorganisms or (2) extraction of plants or marine organisms. Natural 
product libraries include polyketides, non-ribosomal peptides, and variants (non-naturally 
occurring) thereof. For a review, see Science 282:63-68 (1998). Combinatorial libraries are 
composed of large numbers of peptides, oligonucleotides, or organic compounds as a mixture. 
These libraries are relatively easy to prepare by traditional automated synthesis methods, PCR, 
cloning, or proprietary synthetic methods. Of particular interest are non-peptide combinatorial 
libraries. Still other libraries of interest include peptide, protein, peptidomimetic, multiparallel 
synthetic collection, recombinatorial, and polypeptide libraries. For a review of combinatorial 
chemistry and libraries created therefrom, see Myers, Curr. Opin. BiotechnoL 8:701-707 (1997). 
Identification of modulators through use of the various libraries described herein permits 
modification of the candidate "hit" (or "lead") to optimize the capacity of the "hit" to modulate 
activity. 

[0154] T1R receptor binding partners that stimulate T1R receptor activity are useful as agonists 

in disease states or conditions characterized by insufficient T1R receptor signaling (e.g., as a 

result of insufficient activity of a T1R receptor ligand). T1R receptor binding partners that block 

ligand-mediated T1R receptor signaling are useful as T1R receptor antagonists to treat disease 
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states or conditions characterized by excessive T1R receptor signaling. Thus, in another aspect, 
the invention provides methods for treating a disease or abnormal condition by administering to a 
patient in need of such treatment a substance that modulates the activity or expression of a 
polypeptide having a sequence of SEQ ID NO:2, SEQ ID NO:61, or SEQ ID NO:64, or 
exhibiting substantially the same biological activity as a polypeptide having a sequence of SEQ 
ID NO:2, SEQ ID NO:61, or SEQ ID NO:64. 

[0155] In addition T1R receptor modulators in general, as well as T1R receptor encoding 
polynucleotides and polypeptides, are useful in diagnostic assays for such diseases or conditions. 

Mimetics 

[0156] Mimetics or mimics of compounds identified herein (sterically similar compounds 
formulated to mimic the key portions of the structure) may be designed for pharmaceutical use. 
Mimetics may be used in the same manner as the compounds identified by the present invention 
that modulate the T1R receptor and hence are also functional equivalents. The generation of a 
structural-functional equivalent may be achieved by the techniques of modeling and chemical 
design known to those of skill in the art. It will be understood that all such sterically similar 
constructs fall within the scope of the present invention. 

[0157] The design of mimetics to a known pharmaceutically active compound is a known 
approach to the development of pharmaceuticals based on a "lead" compound. This is desirable 
where, for example, the active compound is difficult or expensive to synthesize, or where it is 
unsuitable for a particular method of administration, e.g., some peptides may be unsuitable active 
agents for oral compositions as they tend to be quickly degraded by proteases in the alimentary 
canal. 

[0158] There are several steps commonly taken in the design of a mimetic. First, the particular 
parts of the compound that are critical and/or important in determining its TIR-modulating 
properties are determined. In the case of a polypeptide, this can be done by systematically 
varying the amino acid residues in the peptide, e.g. by substituting each residue in turn. Alanine 
scans of peptides are commonly used to refine such peptide motifs. 

[0159] Once the active region of the compound has been identified, its structure is modeled 
according to its physical properties, e.g. stereochemistry, bonding, size, and/or charge, using data 
from a range of sources, such as, but not limited to, spectroscopic techniques, X-ray diffraction 
data, and NMR. Computational analysis, similarity mapping (which models the charge and/or 
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volume of the active region, rather than the bonding between atoms), and other techniques 
known to those of skill in the art can be used in this modeling process. 

[0160] In a variant of this approach, the three-dimensional structure of the compound that 
modulates a T1R receptor and the active region of the T1R receptor are modeled. This can be 
especially useful where either or both of these compounds change conformation upon binding. 
Knowledge of the structure of the ligand-binding domain (for example, residues 1-571 of SEQ 
ID NO:2) of the receptor also allows the design of high potency ligands and/or modulators. 

[0161] A template molecule is then selected onto which chemical groups that mimic the T1R 
modulator can be grafted. The template molecule and the chemical groups grafted onto it can 
conveniently be selected so that the mimetic is easy to synthesize, is pharmacologically 
acceptable, and does not degrade in vivo, while retaining the biological activity of the lead 
compound. Alternatively, where the mimetic is peptide-based, further stability can be achieved 
by cyclizing the peptide, thereby increasing its rigidity. The mimetic or mimetics found by this 
approach can then be screened by the methods of the present invention to see whether they have 
the ability to modulate the T1R receptor. Further optimization or modification can then be 
performed to arrive at one or more final mimetics for in vivo or clinical testing. 

Compositions of binding and/or onodunlating compounds 

[0162] Following identification of a compound that binds and/or or modulates a T1R receptor, 
the compound may be manufactured and/or used in preparation of compositions including, but 
not limited to, foods, drinks, and pharmaceutical compositions. The compositions are provided or 
administered to patients, including, but not limited to, avians, felines, canines, bovines, ovines, 
porcines, equines, rodents, simians, and humans. 

[0163] Thus, the present invention extends, in various aspects, not only to compounds 
identified in accordance with the methods disclosed herein but also foods, drinks, pharmaceutical 
compositions, drugs, or other compositions comprising such a compound; methods comprising 
administration of such a composition to a patient, e.g. for treatment (which includes prophylactic 
treatment) of a T1R receptor-associated disorder (e.g., obesity, diabetes); uses of such a 
compound in the manufacture of a composition for administration to a patient; and methods of 
making a composition comprising admixing such a compound with a pharmaceutically 
acceptable excipient, vehicle or carrier, and optionally other ingredients. 
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[0164] The compositions of the invention comprise a taste-modifying amount of at least one or 
more binding or modulating compounds. A "taste-modifying amount" is a quantity sufficient to 
increase or decrease the perception of a taste stimulus by a given mammal. The food and drink 
compositions of the invention are formulated by the addition of a binding or modulating 
compound to a food or drink of the mammal. Such compositions may be individualized or 
breed-specific. For example, feline veterinary specialty diets may thus be made more palatable. 

[0165] The pharmaceutical compositions of the invention comprise a therapeutically effective 
amount of a compound identified according to the methods disclosed herein, or a 
pharmaceutically acceptable salt thereof, and a pharmaceutically acceptable carrier or excipient. 

[0166] The compounds of the invention can be formulated as neutral or salt forms. 
Pharmaceutically acceptable salts include those formed with free amino groups such as those 
derived from hydrochloric, phosphoric, acetic, oxalic, tartaric acids, etc., and those formed with 
free carboxyl groups such as those derived from sodium, potassium, ammonium, calcium, ferric 
hydroxides, isopropylamine, triethylamine, 2-ethylamino ethanol, histidine, procaine, etc. 

[0167] Pharmaceutically acceptable carriers include but are not limited to saline, buffered 
saline, dextrose, water, glycerol, ethanol, and combinations thereof. The carrier and composition 
can be sterile. The formulation should suit the mode of administration. 

[0168] The composition, if desired, can also contain minor amounts of wetting or emulsifying 
agents or pH buffering agents. The composition can be a liquid solution, suspension, emulsion, 
tablet, pill, capsule, sustained release formulation, or powder. The composition can be 
formulated as a suppository, with traditional binders and carriers such as triglycerides. Oral 
formulations can include standard carriers such as pharmaceutical grades of mannitol, lactose, 
starch, magnesium stearate, sodium saccharine, cellulose, magnesium carbonate, etc. 

[0169] The pharmaceutical compositions of the invention may further comprise a secondary 
compound for the treatment of a disorder unrelated to the T1R receptor, such as an antibiotic or 
other therapeutic agent, to improve the palatability of the pharmaceutical composition, thereby 
improving the ease of administration. 

[0170] In one embodiment, the composition is formulated in accordance with routine 
procedures as a pharmaceutical composition adapted for oral (e.g., tablets, granules, syrups) or 
non-oral {e.g., ointments, injections) administration to the subject. Various delivery systems are 
known and can be used to administer a compound that modulates a T1R receptor, e.g., 
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encapsulation in liposomes, microparticles, microcapsules, expression by recombinant cells, 
receptor-mediated endocytosis, construction of a therapeutic nucleic acid as part of a retroviral or 
other vector, etc. Methods of introduction include but are not limited to intradermal, 
intramuscular, intraperitoneal, intravenous, subcutaneous, intranasal, topical, and oral routes. 

[0171] The compounds of the invention may be administered by any convenient route, for 
example by infusion or bolus injection, by absorption through epithelial or mucocutaneous 
linings {e.g., oral mucosa, rectal and intestinal mucosa, etc.), and may be administered together 
with other biologically active agents, for example in HAART therapy. Administration can be 
systemic or local. In addition, it may be desirable to introduce the pharmaceutical compositions 
of the invention into the central nervous system by any suitable route, including intraventricular 
and intrathecal injection; intraventricular injection may be facilitated by an intraventricular 
catheter, for example, attached to a reservoir, such as an Ommaya reservoir. 

[0172] In a specific embodiment, it may be desirable to administer the pharmaceutical 
compositions of the invention locally to the area in need of treatment; this may be achieved by, 
for example, and not by way of limitation, local infusion during surgery; topical application, e.g., 
in conjunction with a wound dressing after surgery; by injection; by means of a catheter; by 
means of a suppository; or by means of an implant, said implant being of a porous, non-porous, 
or gelatinous material, including membranes, such as sialastic membranes, or fibers. 

[0173] The composition can be administered in unit dosage form and may be prepared by any 
of the methods well known in the pharmaceutical art, for example, as described in Remington's 
Pharmaceutical Sciences (Mack Publishing Co., Easton, PA). The amount of the compound 
of the invention that modulates a T1R receptor that is effective in the treatment of a particular 
disorder or condition will depend on factors including but not limited to the chemical 
characteristics of the compounds employed, the route of administration, the age, body weight, 
and symptoms of a patient, the nature of the disorder or condition, and can be determined by 
standard clinical techniques. Typically therapy is initiated at low levels of the compound and is 
increased until the desired therapeutic effect is achieved. In addition, in vitro assays may 
optionally be employed to help identify optimal dosage ranges. Suitable dosage ranges for 
intravenous administration are preferably generally about 20-500 micrograms of active 
compound per kilogram body weight. Suitable dosage ranges for intranasal administration are 
preferably generally about 0.01 pg/kg body weight to 1 mg/kg body weight. Suppositories 
preferably generally contain active ingredient in the range of 0.5% to 10% by weight; oral 
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formulations preferably may contain 10% to 95% active ingredient. Effective doses may be 
extrapolated from dose-response curves derived from in vitro or animal model test systems. 

[0174] Typically, compositions for intravenous administration are solutions in sterile isotonic 
aqueous buffer. Where necessary, the composition may also include a solubilizing agent and a 
local anesthetic such as lidocaine to ease pain at the site of the injection. Generally, the 
ingredients are supplied either separately or mixed together in unit dosage form, for example, as 
a dry-lyophilized powder or water-free concentrate in a hermetically sealed container such as an 
ampoule or sachette indicating the quantity of active agent. Where the composition is to be 
administered by infusion, it can be dispensed with an infusion bottle containing sterile 
pharmaceutical grade water or saline. 

[0175] Where the composition is administered by injection, an ampoule of sterile water for 
injection or saline can be provided so that the ingredients may be mixed prior to administration. 

Treatment Methods 

[0176] The invention provides methods of treatment of T1R receptor-associated disorders by 
administering to a subject or patient an effective amount of a compound that modulates the T1R 
receptor. In some aspects of the invention, the compounds or pharmaceutical compositions of 
the invention are administered to a patient having an increased risk of or having a disorder 
associated with the T1R receptor. The patient may be, for example, avian, feline, canine, 
bovine, ovine, porcine, equine, rodent, simian, or human. 

Kits 

[0177] A kit of the invention comprises a carrier means being compartmentalized to receive in 
close confinement one or more container means such as vials, tubes, and the like, each of the 
container means comprising an element to be used in the methods of the invention. For example, 
one of the container means may comprise the a polynucleotide encoding a T1R receptor of the 
invention, a T1R receptor of the invention, or an antibody thereto. The kit may also have one or 
more conventional kit components, including, but not limited to, instructions, test tubes, 
Eppendorf™ tubes, labels, reagents helpful for quantification of marker gene expression, etc. 
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EXAMPLES 

[0178] The following examples are meant to be illustrative of the present invention and are not 
intended to limit the scope thereof. 

Cloning and Characterization of the Feline T1R3 receptor 

[0179] The discovery of feline taste receptor, T1R3, was achieved by using a molecular 
strategy termed "overgo" (Thomas, et al., Genome Res., 12:1277-1285 (2002); Vollrath, D., 
DNA markers for physical mapping In GENOME ANALYSIS: A LABORATORY manual, Vol. 4, ed. 
B. Birren, et al, pp. 187-215, 1999). Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
New York.). This strategy involves the use of the shortest DNA probes among the many kinds 
of probes used in bacterial artificial chromosome (BAC) library screening. These probes are 
comprised of two DNA sequences {e.g., 22mers or 24mers) with a complementary 8 base 
overlap. They can be designed by computer program (genome.wustl.edu/tools/?overgo=l) and 
are readily synthesized. 

[0180] Overgo probes were designed from conserved regions of the chromosome 1 marker, 
"disheveled 1" (DVL1) and the G protein-coupled receptor, T1R3, by aligning DVL1 and T1R3 
genomic sequences from many different species. The overlapping sequences of the seven DVL1 
overgo probes used in the present invention were as follows: 
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2 ID 
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28) 





OvlOa AT C T AC C A C AT G G AC GAG GAG GAG (SEQ ID NO: 2 9) 

OvlOb TGACCAGGTACGGCGTCTCCTCCT (SEQ ID NO: 30) 

Ovlla AGCGCGTCACGCTGGCCGACTTCA (SEQ ID NO: 31) 

Ovllb TTGCTGAGCACGTTCTTGAAGTCG (SEQ ID NO: 32) 

Ovl2a CACGCCTACAAATTCTTCTTTAAG (SEQ ID NO: 33) 

Ovl2b AGTCCTGGTCCATGGACTTAAAGA (SEQ ID NO: 34) 
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The overlapping sequences of the twelve T1R3 overgo probes used in the present invention were 
as follows: 



tlr3 
tlr3 


-OVla 
-OVlb 


CTTCCACTCCTGCTGCTACGACTG 
TGCCTCGCAGTCCACGCAGTCGTA 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


35) 
36) 


tlr3 
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-OV2a 
-OV2b 


AGGTGCGCCGCGTCAAGGGCTTCC 
TCGTAGCAGCAGGAGTGGAAGCCC 


(SEQ 
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ID 
ID 


NO: 
NO: 


37) 
38) 


tlr3 
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-OV3a 
-OV3b 


GTTCCTGGCATGGGGGGAGCCGGC 
GAGCAGCACAAGCACAGCCGGCTC 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


39) 
40) 


tlr3 
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-OV4a 
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ACAGCCCACTAGTTCAGGCCGCAG 
CAGGCCCGGGGTCCCCCTGCGGCC 


(SEQ 
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ID 
ID 


NO: 
NO: 
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42) 
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CCCACTGGTTCAGGCCTCGGGGGG 
AAAGCAGGCCAGGGGCCCCCCCGA 
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NO: 
NO: 
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AGGCGCTGGTGCACTGCCGCACAC 
AAGCTGACCCAGGAGCGTGTGCGG 


(SEQ 
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ID 
ID 


NO: 
NO: 


45) 
46) 


tlr3 
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-OV7a 
-OV7b 


ACAGAGGCACTGGTGCACTGCCGC 
TGATCCAGGAGTGCACGCGGCAGT 


(SEQ 
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ID 
ID 


NO: 
NO: 


47) 
48) 


tlr3 
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-OV8a 
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ACCAATGCCACGCTGGCCTTTCTC 
AAGTGCCCAGGAAGCAGAGAAAGG 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


49) 
50) 


tlr3 
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-OV9a 
-OV9b 


TGGTACATGCTGCCAATGCCACGC 
AAGCAGAGGAAAGCCAGCGTGGCA 


(SEQ 
(SEQ 


ID 
ID 


NO: 
NO: 


51) 
52) 



tlr3-OV10a TACAACCGTGCCCGTGGCCTCACC (SEQ ID NO: 53) 
tlr3-OVl0b AGGCCAGCATGGCGAAGGTGAGGC (SEQ ID NO: 54) 

tlr3-OVlla TCATCACCTGGGTCTCCTTTGTGC (SEQ ID NO: 55) 
tlr3-OVllb ACATTGGCCAGGAGGGGCACAAAG (SEQ ID NO: 56) 

tlr3-OV12a TGCAGATGGGTGCCCTCCTGCTCT (SEQ ID NO: 57) 
tlr3-OV12b AGGATGCCCAGCACACAGAGCAGG (SEQ ID NO: 58). 

The single-stranded overhangs were filled in with 32 P labeled dATP and dCTP, and the overgo 
probes hybridized with B AC libraries. 

[0181] The overgo strategy is considered to be more versatile than a PCR-based strategy by 
those skilled in the art of comparative physical mapping for the following reasons: (1) overgo 
probes are short (e.g., 36mers or 40mers), making the probability of good alignment from among 
many species more favorable; (2) overgo probes are more specific to the target genes compared 
with traditional cDNA and genomic DNA probes used by PCR; and (3) although overgo probes 
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are short, they are not as restricted as traditional PCR probes, which cannot tolerate even a few 
mismatches, because they can be used in hybridization approaches with BACs or other libraries. 

[0182] Screening a feline genomic BAG library. Seven DVL1 overgo probes (SEQ ID 
NOS:21-34) were used in screening a feline genomic BAC library. Probes were radioactively 
labeled by the random hexa-nucleotide method (Feinberg & Vogelstein, Analytical Biochemistry, 
132:6-13 (1983)). Hybridization and washing of membranes followed standard protocols 
(Church & Gilbert, PNAS U.S.A., 81:1991-1995 (1984)). Thirty-nine positive BAC clones were 
identified. Several BAC ends were sequenced. One clone containing homologous sequence to 
human chromosome lp36, BAC 552J19, was identified using bioinformatics tools. 

[0183] Production of a shotgun library for BAC 552J19 and identification of a single 
clone containing feline T1R3. BAC DNA from 552J19 was prepared by using Qiagen Large 
Construct Kit. DNA was then digested by the restriction enzyme Sau3Al and subcloned into 
pGEM+3Z (Promega) vector. After transformants were arrayed to a nylon membrane, two 
separate hybridizations were performed using seven DVL1 and twelve T1R3 overgo probes 
(SEQ ID NOS:35-58). Two clones positive for DVL1 and four clones positive for T1R3 were 
found. These clones were confirmed by sequencing. Because DVL1 is the neighboring gene of 
T1R3 in human and mouse, it is likely this also is the case in cat; therefore, the DVL1 positive 
clones verified that the BAC 552J19 is the correct BAC, that is, it is the one containing feline 
T1R3. 

Results 

[0184] More than 3 kb of genomic sequences containing the open reading frame for domestic 
cat taste receptor, T1R3, were obtained. Figure 1 shows the multiple sequence alignments of the 
known nucleotide sequences for the T1R receptors human (T1R1, SEQ ID NO:8; T1R2, SEQ ID 
NO:5; T1R3, SEQ ID NO: 11), mouse (T1R1, SEQ ID NO:6; T1R2, SEQ ID NO:3; T1R3, SEQ 
ID NO:9), and rat (T1R1, SEQ ID NO:7; T1R2, SEQ ID NO:4; T1R3, SEQ ID NO:10), along 
with the newly discovered and novel nucleotide sequence for the T1R3 taste receptor from 
domestic cat (SEQ ID NO:l). 

[0185] Figure 2 shows the deduced amino acid sequence of the domestic cat taste receptor, 
T1R3 (SEQ ID NO:2), aligned with the amino acid sequences of the T1R receptor family human 
(T1R1, SEQ ID NO:17; T1R2, SEQ ID NO:20; T1R3, SEQ ID NO:12), rat (T1R1, SEQ ID 
NO:16; T1R2, SEQ ID NO:19; T1R3, SEQ ID NO:14), and mouse (T1R1, SEQ ID NO:15; 
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T1R2, SEQ ID N0:18; T1R3, SEQ ID N0:13). The deduced cat sequence predicts four 
additional amino acids at positions 11-14 relative to the homologous T1R3 receptors of mouse, 
human, and rat. The deduced sequence for cat reveals a threonine in position 64, a position 
equivalent to amino acid 60 in mouse, and a leucine at position 59, a position equivalent to 
position 55 in mouse. In mouse, amino acid substitutions of a threonine at position 60 and an 
alanine at position 55, both positions located within the putative extracellular N-terminal domain 
of the polypeptide, are present in strains of mice demonstrating low preference for the sweet 
stimulus saccharin (Bachmanov et aL, Chem. Senses, 26:925-933 (2001)). Leucine is a 
conservative substitution for alanine. Accordingly, the amino acid sequence differences of cat 
and mouse T1R3 receptor may account for functional differences that lead to different taste 
preferences between the two species. For example, the amino acid substitutions may explain the 
cat's inability to taste many compounds that have a sweet taste to mice and humans. 

[0186] The rat and mouse have closely related T1R receptors, while the T1R3 of human and 
cat diverge from these two, as illustrated in the phylogenetic tree of Figure 3. Interestingly, the 
types of sweet compounds to which the rat and mouse respond are very similar, whereas those 
that stimulate the human and those that stimulate the cat are much different from those for rat 
and mouse, and whereas the compounds that stimulate the cat and human receptors also are very 
different. 

[0187] The feline T1R3 receptor is a seven transmembrane receptor similar in structure to 
other known members of the T1R family of receptors (Figure 4). The structure of the feline 
T1R3 receptor was generated through use of a protein modeling program available at 
<www.ebi.ac.uk/~moeller/transmembrane.html>. 

Cloning and characterization of the feline T1R1 and T1R2 receptors 

Eludication of the cat T1R1 and cat T1R2 receptors also was accomplished using an 

overgo strategy. Overgo probes from conserved coding regions were designed by aligning T1R1 

and T1R2 sequences from many different species, including human, mouse, rat, cow, and pig. 

The single-stranded overhangs (14 bases) were filled in with 32 P-labeled dATP and dCTP, and 

the overgo probes hybridized with BAC libraries. The overlapping sequences of the six cat 

T1R1 overgo probes were as follows: 

tlrl_l-OVa TAAACAACTCCACGGCCCTGCTGC (SEQ ID NO: 65) 
. tlrl_l-OVb CCCAGGGTGATGTTGGGCAGCAGG (SEQ ID NO: 66) 

tlrl_2-OVa GCTGTGTATGCGGTGGCCCATGGC (SEQ ID NO: 67) 
tlrl_2-OVb CCAGGAGCTGGTGGAGGCCATGGG (SEQ ID NO: 68) 
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GCCACTGGATTTTGGTCTCATTTA ( 
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AGCTAACACGCTGCTGCTGCTGCT ( 
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The overlapping sequences of the seven cat T1R2 overgo probes were 


as follows: 
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[0188] Screening a feline genomic BAC library. The T1R1 and T1R2 overgo probes were 
used to screen a feline genomic BAC library. Probes were radioactively labeled by the random 
hexa-nucleotide method (Feinberg & Vogelstein, Analytical Biochemistry, 132(1):6-13 (1983)). 
Hybridization and washing of membranes followed standard protocols (Church & Gilbert, PNAS, 
81:1991-1995 (1984)). Six positive BAC clones for cat T1R1 and eight positive BAC clones for 
cat T1R2 were identified. 



[0189] Production of shotgun libraries for BACs containing cat T1R1 and T1R2, and 
identification of some small insert clones containing feline T1R1 and T1R2. Two BACs 
(150M6 and 233G22) containing cat T1R1 and three BACs (93C1, 240H9 and 400B1) 
containing cat T1R2 were used to prepare BAC DNAs using Qiagen Large Construct Kit. The 
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BAC DNAs were digested using the enzyme Sau3AI and the digested BAC DNA fragments 
subcloned into pGEM+3Z (Promega) vector. After transformants were arrayed to a nylon 
membrane, two separate hybridizations were performed using pooled six T1R1 and seven T1R2 
overgo probes. By sequencing positive clones from shotgun libraries and by using a chromosome 
walking strategy, the full coding region of the cat T1R1 and exon3 to exon 6 of cat T1R2 were 
obtained. 

[0190] Elucidation of exon 1 and exon 2 of the cat T1R2 by PGR strategy. Since exon 1 
and exon 2 of the cat T1R2 were not present in the three BACs selected above, PCR was 
performed using degenerate primers designed from T1R2 alignments from different species 
(human, rodents and dog) and cat genomic DNA as template. 

Degenerate primers for cat T1R2 exonl and exon2: 

PCR product size 



Dexlf 1: 


:5' 


TCRGACTTCTACCTGCCTGGRGA 3 ' 


(SEQ 


ID NO:91) 


85bp 


Dexlrl ; 


:5' 


CTTCACGTTGGCATGGAGGG 3 1 


(SEQ 


ID NO:92) 




Dexlf 2 : 


:5' 


TACCTCCTGGGTGGCCTCTTC 3 ' 


{SEQ ID NO: 93) 


66bp 


Dexlr2 ; 


:5' 


TCTTGCACwkGGGCACCTGC 3 ' 


(SEQ 


ID NO:94) 




Dex2f 1: 


: 5 T 


AGGTGtTGGGCTACAACCTsAT 3 ' 


(SEQ 


ID NO:95) 


2 06bp 


Dex2rl : 


: 5' 


GGGCAkGTAGTGGCTGTAGTC 3 1 


(SEQ 


ID NO:96) 




Dex2f2 : 


;5' 


GGCTACAACCTsATGCAGGCCA 3 ' 


(SEQ 


ID NO:97) 


220bp 


Dex2r2 : 


:5' 


GAGTTGTCAGGGCCAATGACCG 3 ' 


(SEQ 


ID NO:98) . 





The PCR products were confirmed by sequencing. The feline BAC library was then re-screened 
using PCR products and four new BACs were retrieved (40545, 2J533, 4F220 and 24D448). 
Using a chromosome walking strategy, the complete sequence of exon 1 and exon 2 from these 
four BAC clones were obtained. 

Results 

[01911 Approximately 10 kb of genomic sequence containing the open reading frame for cat 
T1R1 and approximately 38kb of genomic sequence containing the open reading frame for cat 
T1R2 was obtained. Figure 1 shows the multiple sequence alignments of the known nucleotide 
sequences for the T1R receptors from human, mouse and rat, along with the nucleotide 
sequences for the cat T1R1, T1R2, and T1R3 taste receptors. The sequences of cat TIRs are 
highlighted. 

[0192] Figure 2 shows the deduced amino acid sequence of the domestic cat taste receptors, 
T1R1, T1R2, and T1R3 aligned with the amino acid sequences of the T1R receptor family from 
human, rat, and mouse. The cat T1R1 is very similar to human and rodents in terms of gene 
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structure; however, cat T1R2 predicts a shorter protein of 391 amino acids compared with the 
human T1R2, which has 839 amino acids. This prediction of a short T1R2 is the result of a stop 
codon TAA in exon 3. 

[0193] Table 4 shows the percent homology among the members of the T1R family in relation 
to the cat T1R taste receptors. The portion of Table 1 to the left of the diagonal (in bold type) 
shows the percent homology based on the open reading frame of the nucleotide sequences 
obtained from Figure 1 for the T1R family among human, cat, rat and mouse. The upper portion 
to the right of the diagonal (in italic type) shows the percent homology of the T1R members 
based on the amino acid sequences of Figure 2. Cat T1R1 shows 84% nucleotide sequence 
homology with human T1R1, 78% with rat T1R1 and 79% with mouse T1R1. At the amino acid 
level, cat T1R1 shows 81% homology with human T1R1, 74% with rat, and 74% with mouse. 
Cat T1R1 shows generally low homology with the other known members of the T1R family, 
T1R2 and T1R3, from human, rat and mouse. The same range of relatively low homology is 
present among the human, rat and mouse T1R1, T1R2 and T1R3 receptors from the same 
species. Cat T1R2 shows 72% nucleotide sequence homology with human T1R2, 61% with rat 
T1R2 and 64% with mouse T1R2. At the amino acid level, cat T1R2 shows 58% homology with 
human T1R2, 52% with rat, and 53% with mouse. Since cat T1R2 has a shorter protein (391aa) 
due to a stop codon in exon 3, cat T1R2 shows much lower homology with T1R2 in other 
species than the homology for T1R1 and T1R3 among different species, which indicates that cat 
T1R2 is very different from that of the other species. This is also consistent with the behavioral 
responses showing that cats do not show preference for carbohydrate sweeteners. This indicates 
that cat T1R2 may not be functional, freeing it from selective pressure. Therefore mutations in 
cat T1R2 most likely have accumulated. Cat T1R2 shows generally low homology with the other 
members of the T1R family, T1R1 and T1R3, from human, rat and mouse. The same range of 
relatively low homology is present among the human, rat, and mouse T1R2 and the T1R1 and 
T1R3 receptors from the same species. 
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Table 4. Percent Homology Among Diverse Species for T1Rs 



Species 


Mouse 
T1R1 


Mouse 
T1R2 


Mouse 
T1R3 


Rat 
T1R1 


Rat 
T1R2 


Rat 
T1 R3 


Human 
T1R1 


Human 
T1R2 


Human 
T1R3 


Cat 

T1R1 


Cat 
T1R2 


Cat 
T1 R3 


Mouse 




36 


30 


90 


36 


30 


73 


37 


30 


74 


30 


30 


T1R1 


























Mouse 


55 




28 


36 


91 


28 


34 


69 


28 


36 


53 


28 


T1 R2 


























Mouse 


33 


15 




31 


28 


92 


30 


27 


72 


30 


25 


72 


T1 R3 


























Rat 


91 


55 


33 




37 


31 


73 


37 


31 


74 


26 


31 


T1R1 


























Rat 


55 


91 


15 


57 




28 


34 


71 


29 


36 


52 


28 


T1 R2 


























Rat 


33 


21 


93 


32 


15 




31 


27 


73 


30 


26 


72 


T1R3 


























Human 


79 


56 


35 


79 


56 


35 




35 


31 


81 


29 


31 


T1R1 


























Human 


57 


78 


17 


56 


78 


17 


57 




28 


36 


58 


28 


T1R2 


























Human 


41 


39 


73 


39 


36 


75 


40 


38 




29 


23 


73 


T1R3 


























Cat 


79 


54 


35 


78 


56 


35 


84 


56 


53 




28 


30 


T1R1 


























Cat 


42 


64 


22 


41 


61 


22 


44 


72 


48 


44 




29 


T1R2 


























Cat 
T1R3 


33 


34 


74 


36 


36 


75 


53 


39 


79 


53 


39 





Note: Upper right cells (italics) contain deduced amino acid homology; lower left cells (bold) contain 
nucleotide homology. 
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What is Claimed: 

1. An isolated and purified polynucleotide encoding a T1R receptor comprising: 

a) the nucleotide sequence of SEQ ID NO:l, SEQ ID NO:59, SEQ ID NO:60, SEQ 
ID NO:62, or SEQ ID NO:63, 

b) a fragment of at least about 42 contiguous nucleotides of SEQ ID NO:l encoding 
a polypeptide having substantially the same biological activity as a polypeptide encoded by the 
nucleotide sequence of SEQ ID NO:l, 

c) a fragment of at least about 42 contiguous nucleotides of SEQ ID NO:59 or SEQ 
ID NO:60 encoding a polypeptide having substantially the same biological activity as a 
polypeptide encoded by the nucleotide sequence of SEQ ID NO:59 or SEQ ID NO:60, 
respectively, 

d) a fragment of at least about 42 contiguous nucleotides of SEQ ID NO:62 or SEQ 
ID NO: 63 encoding a polypeptide having substantially the same biological activity as a 
polypeptide encoded by the nucleotide sequence of SEQ ID NO:62 or SEQ ID NO:63, 
respectively, 

e) a variant of the polynucleotide of SEQ ID NO:l having at least 80% homology to 
the polynucleotide of SEQ ID NO:l and encoding a polypeptide having substantially the same 
biological activity as a polypeptide encoded by the nucleotide sequence of SEQ ID NO:l, 

f) a variant of the polynucleotide of SEQ ID NO:59 or SEQ ID NO:60 having at 
least 85% homology to the polynucleotide of SEQ ID NO:59 or SEQ ID NO:60 and encoding a 
polypeptide having substantially the same biological activity as a polypeptide encoded by the 
nucleotide sequence of SEQ ID NO:59 or SEQ ID NO:60, respectively, 

g) a variant of the polynucleotide of SEQ ID NO:62 or SEQ ID NO:63 having at 
least 75% homology to the polynucleotide of SEQ ID NO:62 or SEQ ID NO:63 and encoding a 
polypeptide having substantially the same biological activity as a polypeptide encoded by the 
nucleotide sequence of SEQ ID NO:62 or SEQ ID NO:63, respectively, 

h) a variant of the polynucleotide of SEQ ID NO:l having at least 80% homology to 
the polynucleotide of SEQ ID NO:l and encoding a polypeptide conferring modified taste 
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perception to one or more taste stimuli relative to a polypeptide encoded by the polynucleotide of 
SEQ ID NO: 1, 

i) a nucleotide sequence encoding the amino acid sequence of SEQ ID NO:2, SEQ 
ID NO:61, or SEQ ID NO:64, 

j) a nucleotide sequence substantially complementary to the nucleotide sequence of 
SEQ ID NO:l, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:62, or SEQ ID NO:63, or 

k) a nucleotide sequence that hybridizes to the complement of the polynucleotide 
having SEQ ID NO:l, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:62, or SEQ ID NO:63 under 
high stringency conditions. 

2. The polynucleotide of claim 1, wherein said polynucleotide is DNA. 

3. The polynucleotide of claim 1, wherein said polynucleotide is RNA. 

4. The polynucleotide of claim 1 comprising a variant of the polynucleotide of SEQ ID 
NO:l encoding an amino acid sequence of SEQ ID NO:2 having a nonconserved amino acid 
substitution at residue 59 or residue 64. 

5. The polynucleotide of claim 1 comprising a fragment of the polynucleotide of SEQ ID 
NO:l, wherein said fragment comprises a nucleotide sequence encoding an extracellular domain 
of the polypeptide of SEQ ID NO:2, a transmembrane domain of the polypeptide of SEQ ID 
NO:2, or an intracellular domain of the polypeptide of SEQ ID NO:2. 

6. The polynucleotide of claim 1 comprising a fragment of the polynucleotide of SEQ ID 
NO: 59 or SEQ ID NO: 60, wherein said fragment comprises a nucleotide sequence encoding an 
extracellular domain of the polypeptide of SEQ ID NO:61, a transmembrane domain of the 
polypeptide of SEQ ID NO:61 , or an intracellular domain of the polypeptide of SEQ ID NO:61 . 

7. The polynucleotide of claim 1 comprising a fragment of the polynucleotide of SEQ ID 
NO:62 or SEQ ID NO:63, wherein said fragment comprises a nucleotide sequence encoding an 
extracellular domain of the polypeptide of SEQ ID NO:64, a transmembrane domain of the 
polypeptide of SEQ ID NO:64, or an intracellular domain of the polypeptide of SEQ ID NO:64. 

8. An expression vector comprising the polynucleotide of claim 1 operably linked to a 
promoter. 
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9. A host cell comprising the expression vector of claim 8. 

10. The host cell of claim 9 wherein said cell is mammalian. 

1 1 . The host cell of claim 10 wherein said cell is a human, murine, or feline cell. 

12. A cell culture comprising at least one cell of claim 8. 

13. An isolated and purified T1R receptor polypeptide encoded by a polynucleotide of claim 
1. 

14. The polypeptide of claim 13 wherein said polypeptide comprises the amino acid sequence 
of SEQ ID NO:2, a fragment of at least 30 contiguous amino acids of SEQ ID NO:2, or a variant 
thereof having substantially the same biological activity as the polypeptide of SEQ ED NO:2. 

15. The polypeptide of claim 13 wherein said polypeptide comprises an amino acid sequence 
having at least one sequence variation of SEQ ID NO:2 wherein said variation confers modified 
taste perception to one or more taste stimuli relative to a polypeptide of SEQ ID NO:2. 

16. The polypeptide of claim 13 wherein said polypeptide comprises the amino acid sequence 
of SEQ ID NO:61, a fragment of at least 40 contiguous amino acids of SEQ ID NO:61 5 or a 
variant thereof having substantially the same biological activity as the polypeptide of SEQ ID 
NO:61. 

17. The polypeptide of claim 13 wherein said polypeptide comprises the amino acid sequence 
of SEQ ID NO:64, a fragment of at least 20 contiguous amino acids of SEQ ID NO:64, or a 
variant thereof having substantially the same biological activity as the polypeptide of SEQ ID 
NO:64. 

18. An isolated and purified T1R3 receptor polypeptide comprising the amino acid sequence 
of SEQIDNO:2. 

19. An isolated and purified T1R2 receptor polypeptide comprising the amino acid sequence 
of SEQ ID NO:64. 

20. An isolated and purified T1R1 receptor polypeptide comprising the amino acid sequence 
of SEQ ID NO:61. 

21 . The polypeptide of claim 14, wherein said polypeptide comprises a feline T1R3 receptor. 
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22. The polypeptide of claim 16, wherein said polypeptide comprises a feline T1R1 receptor. 

23. The polypeptide of claim 17, wherein said polypeptide comprises a feline T1R2 receptor. 

24. A kit for the detection of a polynucleotide encoding a feline T1R receptor comprising a 
polynucleotide that specifically hybridizes to a polynucleotide encoding a polypeptide having an 
amino acid sequence of SEQ ID NO:2, SEQ ID NO:61, or SEQ ID NO:64 and instructions 
relating to detection of said polynucleotide. 

25. A method of producing a feline T1R receptor comprising culturing the host cell of claim 
9 and recovering said receptor from said host cell. 

26. The feline T1R receptor produced according to the method of claim 25. 

27. A method for identifying compounds that interact with a feline T1R receptor comprising: 

contacting a feline T1R receptor of claim 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, or 23 with 
a test compound, and 

detecting interaction between said receptor and said compound. 

28. The method of claim 27, wherein said receptor is bound to a solid support. 

29. The method of claim 28, wherein said solid support is formulated into a feline-specific 
electronic tongue. 

30. The method of claim 27 wherein said step of contacting said T1R receptor with said test 
compound occurs in the presence of a heterodimerization partner of said T1R receptor. 

31. A method for identifying an agonist of a feline T 1R receptor comprising: 

expressing a polynucleotide of claim 1 in the presence of a test compound, and 

detecting an increase in biological activity of a polypeptide produced by said expression 
step in the presence of said compound relative to biological activity of said polypeptide in the 
absence of said compound. 

32. The method of claim 31 wherein said polynucleotide is expressed in the presence of the 
heterodimerization partner of said T1R receptor. 



33. 



A method for identifying an agonist of a feline T1R receptor comprising: 
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contacting a polypeptide of claim 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, or 23 with a test 
compound, and 

detecting an increase in biological activity of said polypeptide in the presence of said 
compound relative to biological activity of said polypeptide in the absence of said compound. 

34. The method of claim 33 wherein said contacting step occurs in the presence of a 
heterodimerization partner of said polypeptide. 

35. A method for identifying an antagonist of a feline T1R receptor comprising: 

expressing a polynucleotide of claim 1 in the presence of a test compound, and 

detecting a decrease in biological activity of a polypeptide produced by said expression 
step in the presence of said compound relative to biological activity of said polypeptide in the 
absence of said compound. 

36. The method of claim 35 wherein said expressing step occurs in the presence of a 
heterodimerization partner of said T1R receptor. 

37. A method for identifying an antagonist of a feline T1R receptor comprising: 

contacting the polypeptide of claim 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, or 23 with a test 
compound, and 

detecting a decrease in biological activity of said polypeptide in the presence of said 
compound relative to biological activity of said polypeptide in the absence of said compound. 

38. The method of claim 37 wherein said contacting step occurs in the presence of a 
heterodimerization partner of said T1R receptor. 

39. The method of claim 33 wherein said polypeptide is bound to a solid support. 

40. The method of claim 39 wherein said solid support is formulated into a feline-specific 
electronic tongue. 

41 . The method of claim 37 wherein said polypeptide is bound to <a solid support. 

42. The method of claim 41 wherein said solid support is formulated into a feline-specific 
electronic tongue. 
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43. A method of identifying a feline T1R3 receptor variant that confers modified taste 
perception comprising expressing a variant of the polynucleotide of SEQ ID NO:l having at least 
80% homology to the polynucleotide of SEQ ID NO:l and detecting an increase or a decrease in 
the biological activity of the polypeptide encoded by the variant relative to the biological activity 
of the polypeptide encoded by SEQ ID NO: 1 . 

44. A method of identifying a feline T1R2 receptor variant that confers modified taste 
perception comprising expressing a variant of the polynucleotide of SEQ ID NO: 62 or SEQ ID 
NO:63 having at least 75% homology to the polynucleotide of SEQ ID NO:62 or SEQ ID NO:63 
and detecting an increase or a decrease in the biological activity of the polypeptide encoded by 
the variant relative to the biological activity of the polypeptide encoded by SEQ ID NO: 62 or 
SEQ ID NO:63. 

45. A method of identifying a feline T1R1 receptor variant that confers modified taste 
perception comprising expressing a variant of the polynucleotide of SEQ ID NO: 59 or SEQ ID 
NO:60 having at least 85% homology to the polynucleotide of SEQ ID NO:59 or SEQ ID NO:60 
and detecting an increase or a decrease in the biological activity of the polypeptide encoded by 
the variant relative to the biological activity of the polypeptide encoded by SEQ ID NO:59 or 
SEQIDNO:60. 

46. The host cell of claim 9 wherein said cell is a bacterial cell. 

47. A T1R receptor comprising at least one extracellular domain of a feline T1R receptor. 

48. The receptor of claim 47 wherein said extracellular domain comprises: 

a) amino acids 1-563, amino acids 624-635, amino acids 701-726, or amino acids 
781-792 of SEQ ID NO:61, 

b) amino acids 1-147 of SEQ ID NO:64, or 

c) amino acids 1-571, amino acids 628-641, amino acids 705-731, or amino acids 
787-794 of SEQ ID NO:2. 

49. A T1R receptor comprising at least one transmembrane domain of a feline T1R receptor. 

50. The receptor of claim 49 wherein said transmembrane domain comprises: 
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a) amino acids 564-589, amino acids 604-623, amino acids 636-660, amino acids 
681-700, amino acids 727-748, amino acids 761-780, or amino acids 793-817 of SEQ ID NO:61, 

b) amino acids 148-167 of SEQ ID NO:64, or 

c) amino acids 1-572, amino acids 610-627, amino acids 642-664, amino acids 681- 
704, amino acids 731-754, amino acids 767-786, or amino acids 795-812 of SEQ ID NO:2. 

51 . A T1R receptor comprising an intracellular domain of a feline T1R receptor. 

52. The T1R receptor of claim 5 1 wherein said intracellular domain comprises: 

a) amino acids 590-603, amino acids 661-680, amino acids 749-760, or amino acids 
818-841 of SEQ ID NO:61, 

b) amino acids 168-391 of SEQ ID NO:64, or 

c) amino acids 595-609, amino acids 665-680, amino acids 755-766, or amino acids 
813-865 of SEQ ID NO:2. 

53. The T1R receptor of any one of claims 47-52, wherein said receptor is a chimeric 
receptor. 

54. A polynucleotide encoding the T1R receptor of any one of claims 47-52. 
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ABSTRACT 



The present invention relates to the discovery of several genes of the domestic cat {Felis 
catus) associated with taste perception. The invention provides, inter alia, the nucleotide 
sequence of the feline T1R1, T1R2, and T1R3 receptor genes, the amino acid sequences of the 
polypeptides encoded thereby, and antibodies to the polypeptides. The present invention also 
relates to methods for screening for compounds that modify the genes' function or activity, the 
compounds identified by such screens, and mimetics of the identified compounds. The invention 
further provides methods for modifying the taste preferences, ingestive responses, or general 
behavior of a mammal by administering compounds that affect the function or activity of the 
gene or the polypeptide encoded thereby. 
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Figure 1 A CLUSTAL W (1 .82) multiple nucleotide sequence alignment of T1 Rs 



mouseTaslr2 

ratTaslr2 

humanTASlR2 

catTaslr2 

mouseTaslrl 

ratTaslrl 

humanTASlRl 

catTaslrl 

mouseTaslr3 

ratTaslr3 

catTaslr3 

humanTASlR3 



ATGGGACCCCAGGCGAG 
ATGGGTCCCCAGGCAAG 
ATGGGGCCCAGGGCAAA- 
ATGGGACCCCGGGCCAG- 



-GACACTCCATTTGCTGTTTCTCCTGCTGCATGCTCTG 5 4 
-GACACTCTGCTTGCTGTCTCTCCTGCTGCATGTTCTG 5 4 
-GACCATCTGCTCCCTGTTCTTCCTCCTATGGGTCCTG 
-GGAAGTCTGCTGCTTCATCATCCTGCCGCGGCTCCTG 



54 
54 

ATGCTTTTCTGGGCAGCTCACCTGCTGCTCAGCCTGCAGCTGGCCGTTGCTTACTGCTGG 60 

ATGCTCTTCTGGGCTGCTCACCTGCTGCTCAGCCTGCAGTTGGTC TACTGCTGG 5 4 

ATGCTGCTCTGCACGGCTCGCCTGGT CGGCCTGCAGCTTCTCATTTCCTGCTGCTGG 57 

ATGTCACTCCCGGCGGCTCACCTGGT CGGCCTGCAGCTCTCCCTCTCCTGCTGCTGG 57 

ATGCCAGCTTTGGCTAT CATGGGTCTCA GCCTGGCTGCTTTCCTG 4 5 

ATGCCGGGTTTGGCTAT CTTGGGCCTCA GTCTGGCTGCTTTCCTG 4 5 

ATGCCCGGCCTCGCTCT CCTGGGCCTCACGGCTCTCCTGGGCCTCACGGCTCTCTTG 57 

ATGCTGGGCCCTGCTGT CCTGGGCCTCA GCCTCTGGGCTCTCCTG 4 5 



mouseTaslr2 C — CT AAGCC AGTCATGCTGGT AGGGAAC -TC CGACTTTCACCTGGCTGGGGACTAC 108 

ratTaslr2 C — CTAAGCCAGGCAAGCTGGTAGAGAAC-TC T G AC TTCCACCTGGC CG G GG AC T AC 108 

humanTASlR2 . G — CTGAGCC GGC T GAG AAC - T C GGACTTCTACCTGCCTGGGGATTAC 99 

catTaslr2 G — CTGAGCC GGCTGAGAAC-TC AGACTTCTACTTGGCTGGGGATTAC 99 

mouseTaslrl G — CTTTCAGCTGCCAAAGGACAGAATCC-TCTCCAGGTTTCAGCCTCCCTGGGGACTTC 117 

ratTaslrl G — CTTTCAGCTGCCAAAGGACAGAGTCC-TCTCCAGGCTTCAGCCTTCCTGGGGACTTC 111 

humanTASlRl G--CCTTTGCCTGCCATAGCACGGAGTCT-TCTCCTGACTTCACCCTCCCCGGAGATTAC 114 

catTaslrl G— CTCTCAGCTGCCACAGCACAGAGACG-TCTGCCGACTTCAGCCTCCCTGGGGATTAC 114 

mouseTaslr3 GAGCTTGGGATGGGGGCCTCTTTGTGTCTGTCACAGCAATTCAAGGCACAAGGGGACTAC 105 

ratTaslr3 GAGCTTGGGATGGGGTCCTCTTTGTGTCTGTCACAGCAATTCAAGGCACAAGGGGACTAT 105 

catTaslr3 GACCACGGGGAGGGCGCAACGTCCTGCTTGTCACAGCAGCTCAGGATGCAGGGGGACTAT 117 

humanTASlR3 CACCCTGGGACGGGGGCCCCATTGTGCCTGTCACAGCAACTTAGGATGAAGGGGGACTAC 105 
* * * * ***** 

mouseTaslr2 CTCCTGGGTGGCCTCTTTACCCTCCATGCCAACGTGAAGAGCGTCTCTCACCTCAGCTAC 168 

ratTaslr2 CTCCTGGGTGGCCTCTTTACCCTCCATGCCAACGTGAAGAGCATCTCCCACCTCAGCTAC 1 68 

humanTAS 1 R2 CTCCTGGGTGGCCTCTTCTCCCTCCATGCCAACATGAAGGGCATTGTTCACCTT AACTTC 159 

catTaslr2 TTCCTCGGCGGCCTCTTCACCCTCCATGCCAACGTGAAGGGCATCGTCCACCTCAACCTC 159 

mouseTaslrl CTCCTGGCAGGCCTGTTCTCCCTCCATGCTGACTGTCTGCAGGTGAGACACA — GACCTC 175 

ratTaslrl CTCCTTGCAGGTCTGTTCTCCCTCCATGGTGACTGTCTGCAGGTGAGACACA — GACCTC 169 

humanTASlRl CTCCTGGCAGGCCTGTTCCCTCTCCATTCTGGCTGTCTGCAGGTGAGGCACA — GACCCG 172 

catTas 1 r 1 CTCCTCGCAGGTCTGTTCCCTCTGCACTCTGACTGTCCGGGCGTGAGGCACC — GGCCCA 172 

mouseTaslr3 ATACTGGGCGGGCTATTTCCCCTGGGCTCAACCGAGGAGGCCACTCTCAACCAGAGAACA 165 

ratTaslr3 ATATTGGGTGGACTATTTCCCCTGGGCACAACTGAGGAGGCCACTCTCAACCAGAGAACA 165 

catTas lr3 GTGCTGGGTGGGCTCTTCCCTCTGGGCTCTGCCGAGGGTACAGGTCTTGGCGACGGGCTG 177 

humanTASlR3 GTGCTGGGGGGGCTGTTCCCCCTGGGCGAGGCCGAGGAGGCTGGCCTCCGCAGCCGGACA 1 65 
* * * ** ** ** * ** * 

mouseTaslr2 CTGCAGGTGCCCAAGTGCAATGAGTACAACA TGAAGGTCTTGGGCTACAACCTCATG 225 

r a t Ta s 1 r 2 CTGCAGGTGCCCAAGTGCAATGAGTTCACCA — - TGAAGGTGTTGGGCT AC AACCTCATG 225 

humanTASlR2 ; CTGC AGGTGCCCATGTGC AAGGAGTATG AAG - - - TG AAGGTGAT AGGCT AC AAC CTC ATG 216 

catTaslr2 CTGCAGGTGCCCCAGTGC AAGGAGTATG AAA TAAAGGTGTTGGGCTACGATCTCATG 216 

mouseTaslrl T GGTGACAAGTTGTGACAGGTCTGACAGCTTCAACGGCCATGGCTATCACCTCTTG 231 

ratTaslrl T GGTGACAAGTTGTGACAGGCCCGACAGCTTCAACGGCCATGGCTACCACCTCTTC 225 

humanTAS 1 Rl A— — GGTGACCCTGTGTGACAGGTCTTGTAGCTTCAATGAGCATGGCTACC ACCT CTTC 228 

catTaslrl C GGTGACCCTCTGTGACAGGCCCGACAGCTTCAACGGTCACGGCTACCACCTCTTC 228 

mouseTaslr3 C AACCCAACAGCATCCCGTGCAACAGGTTCTCACCCCTTGGTTTGTTCCTGGCC 219 

ratTaslr3 C -AGCCCAACGGCATCCTATGTACCAGGTTCTCGCCCCTTGGTTTGTTCCTGGCC 219 

catTaslr3 C AGCCCAATGCCACCGTGTGCACCAGGTTCTCGTCTCTGGGCCTGCTCTGGGCG 231 

humanTASlR3 C GGCCCAGCAGCCCTGTGTGCACCAGGTTCTCCTCAAACGGCCTGCTCTGGGCA 219 

* * ** 



mouseTaslr2 CAGGCCATGCGATTCGCCGTGGAGGAAATCAACAACTGTAGCTCTCTGCTGCCCGGCGTG 285 

ratTaslr2 CAGGCCATGCGTTTCGCTGTGGAGGAGATCAACAACTGTAGCTCCCTGCTACCCGGCGTG 285 

humanTAS 1R2 CAGGCCATGCGCTTCGCGGTGGAGGAGATCAACAATGACAGCAGCCTGCTGCCTGGTGTG 276 

catTas lr2 CAGGCCATGTGCTTTGCAGGGGAGGAGATCAATAGCCAGAGCAGCCTGCTGCCTGGCGTG 276 

mouseTaslrl CAAGCCATGCGGTTCACCGTTGAGGAGATAAACAACTCCACAGCTCTGCTTCCCAACATC 291 

ratTaslrl ' C AAGCC ATGCGGTTCACTGTTGAGGAGATAAACAACTCCTCGGCCCTGCTTCCCAACATC 285 

humanTASlRl CAGGCTATGCGGCTTGGGGTTGAGGAGATAAACAACTCCACGGCCCTGCTGCCCAACATC 288 

ca tTas lr 1 CAGGCCATGCGGTTTGGCATCGAGGAGATAAACAACTCCACGGCCCTCCTGCCGAACGTC 2 88 

mouseTaslr3 ATGGCTATGAAGATGGCTGTGGAGGAGATCAACAATGGATCTGCCTTGCTCCCTGGGCTG 279 

ratTaslr3 ATGGCTATGAAGATGGCTGTAGAGGAGATCAACAATGGATCTGCCTTGCTCCCTGGGCTG 279 

catTas lr 3 CTGGCCGTGAAGATGGCGGTGGAGGAGATCAACAACGGGTCGGCCCTGCTGCCCGGGCTG 2 91 

humanTAS 1R3 CTGGCCATGAAAATGGCCGTGGAGGAGATCAACAACAAGTCGGATCTGCTGCCCGGGCTG 279 



***** ** ** * 



* * * ** 
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Figure 1B 
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CTGCTCGGCTACGAGATGGTGGATGTCTGCTACCTCTCC AACAATATCCAGCCTGGG 342 

CTGCTCGGCTACGAGATGGTGGATGTCTGTTACCTCTCC AACAATATCCACCCTGGG 342 

CTGCTGGGCTATGAGATCGTGGATGTGTGCTACATCTCC AACAATGTCCAGCCGGTG 333 

CTGCTGGGCTACAAAATGGTGGATGTCAGCTACATCTCC AACAATGTCCAGCCCGTG 333 

ACCCTGGGGTATGAACTGTATGACGTGTGCTCAGAGTCT TCCAATGTCTATGCCACC 348 

ACCCTGGGGTATGAGCTGTACGACGTGTGCTCAGAATCT GCCAATGTGTATGCCACC 3 42 

ACCCTGGGGTACCAGCTGTATGATGTGTGTTCTGACTCT GC CAAT GTGT AT GCCACG 345 

ACCCTGGGATACCAGCTGTACGACGTGTGCTCGGAGTCT GCCAACGTGTATGCCACA 345 

CGGCTGGGCTATGACCTATTTGACACATGCTCCGAGCCAGTGGTCACCATGAAATCCAGT 339 
CGACTGGGCT ATGACCTGTTTGACACATGCTCAGAGCCAGTGGTCACCATGAAGCCCAGC 339 
CACCTGGGCTATGACCTCTTTGACACGTGTTCAGAGCCCATGGTGGCCATGAAGCCCAGC 351 
CGCCTGGGCT ACGACCTCTTTGATACGTGCTCGGAGCCTGTGGTGGCC ATGAAGCCCAGC 339 



mouseTaslr2 CTCTACTTCCTGTC ACAGATAGATGACTTCCTGCCCATCCTCAAAGACTACAGCCAG 399 

ratTas lr2 . CTCTACTTCCTGGC ACAGGACGACGACCTCCTGCCCATCCTCAAAGACTACAGCCAG 399 

humanTAS 1R2 CTCTACTTCCTGGC ACACGAGGACAACCTCCTTCCCATCCAAGAGGACTACAGTAAC 390 

catTaslr2 CTCCACTTCCCGGC AAAGGAGGACTGTTCCTTGCCCATCCAGGAGGACTACAGCCAC 3 90 

mouse Tas lr 1 CTGAGGGTGCTCGCCCAGCAAGGGACAGGCCACCTAGAGATGCAGAGAGATCTTCGCAAC 4 08 

ratTas 1 r 1 CTGAGGGTGCTTGCCCTGCAAGGGCCCCGCCACATAGAGATACAGAAAGACCTTCGCAAC 402 

humanTASlRl CTGAGAGTGCTCTCCCTGCCAGGGCAACACCACATAGAGCTCCAAGGAGACCTTCTCCAC 405 

catTas lrl CTAAACGTGCTCTCCCTGCTGGGGACACATCACGTAGAGATCCGAGCAGACCCTTCCCAC 405 

mouseTas lr3 CTCATGTTCCTGGCCAAGGTGGGCAGTCAAAGCATTGCTGCCTACTGCAACTACACACAG 399 

ratTaslr3. CTCATGTTCATGGCCAAGGTGGGAAGTCAAAGCATTGCTGCCTACTGCAACTACACACAG 399 

catTaslr3 CTCGTGTTCATGGCCAAAGCAGGCAGCTGCAGCATTGCCGCCTACTGCAATTACACACAG 411 

humanTAS 1 R3 CTC ATGTTCCTGGCCAAGGCAGGC AGCCGCGACATCGCCGCCTACTGCAACTACACGC AG 399 
** * * * * * * 

mouseTaslr2 ■ TACAGGCCCCAAGTGGTGGCCGTCATTGGCCCAGACAACTCTGAGTCCGCCATCACCGTG 459 

ratTas lr2 TACATGCCCCACGTGGTGGCTGTCATTGGCCCCGACAACTCTGAGTCCGCCATTACCGTG 459 

humanTAS 1R2 TACATTTCCCGTGTGGTGGCTGTCATTGGCCCTGACAACTCCGAGTCTGTCATGACTGTG 4 50 

catTas 1 r2 TGTGTGCCCCGTGTGGTGGCTGTCATTGGTCCTGGCAACTCTGAGTCCACTGTGACTGTG 4 50 

mouseTaslrl CACTCCTCCAAGGTGGTGGCACTCATTGGGCCTGATAACACTGACCACGCTGTCACCACT 4 68 
ratTaslrl . . CACTCCTCCAAGGTGGTGGCCTTCATCGGGCCTGACAACACTGACCACGCTGTCACTACC 4 62 

humanTASlRl TATTCCCCTACGGTGCTGGCAGTGATTGGGCCTGACAGCACCAACCGTGCTGCCACCACA 4 65 

catTaslrl TATTCGCCTGCCGCCCTGGCTGTCATTGGGCCTGACACCACCAACCACGCAGCCACCACT 4 65 

mouseTaslr3 TACCAACCCCGTGTGCTGGCTGTCATCGGCCCCCACTCATCAGAGCTTGCCCTCATTACA 459 

ratTaslr3 TACCAACCCCGTGTGCTGGCTGTCATTGGTCCCCACTCATCAGAGCTTGCCCTCATTACA 459 

catTas lr3 TACCAGCCCCGCGTGCTGGCCGTCATCGGGCCCCACTCGTCTGAGCTCGCCCTCGTCACC 471 

humanTAS 1R3 TACCAGCCCCGTGTGCTGGCTGTCATCGGGCCCCACTCGTCAGAGCTCGCCATGGTCACC 459 

■* * **** * * ■*• ■*•+ +■* * * 

mouseTas 1 r2 TCCAACATTCTCTCCTACTTCCTCGTGCCACAGGTCACATATAGCGCCATC ACCGACAAG 519 

ratTas lr2 TCCAACATTCTCTCTCATTTCCTCATCCCACAGATCACATACAGCGCCATCTCCGACAAG 519 

humanTASlR2 GCCAACTTCCTCTCCCTATTTCTCCTTCCACAGATCACCTACAGCGCCATCAGCGATGAG 510 

catTaslr2 GCCCGCTTCCTCTCTCTCTTCCTCCTTCCACAGATCACCTACAGCGCCATCAGTGACGAG 510 

mouseTaslrl GCTGCCCTGCTGAGCCCTTTTCTGATGCCCCTGGTCAGCTATGAGGCGAGCAGCGTGATC 528 

ratTas 1 r 1 GCTGCCTTGCTGGGTCCTTTCCTGATGCCCCTGGTCAGCTATGAGGCAAGC AGCGTGGTA 522 

humanTASlRl GCCGCCCTGCTGAGCCCTTTCCTGGTGCCCATGATTAGCTATGCGGCCAGCAGCGAGACG 525 

catTaslrl GCAGCCCTGCTGAGCCCCTTCCTGGTGCCCCTGATCAGCTACGAGGCCAGCAGCGTGACG 525 

mouseTas 1 r3 GGCAAGTTCTTCAGCTTCTTCCTCATGCCACAGGTCAGCTATAGTGCCAGCATGGATCGG 519 
ratTaslr3 . GGCAAGTTCTTCAGCTTCTTCCTCATGCCACAGGTCAGCTATAGTGCCAGCATGGATCGG 519 
catTas lr3 ■* GGCAAGTTCTTCAGCTTCTTCCTTGTGCCTCAGGTCAGCTACGGCGCCAGCACCGACCGG 531 

humanTAS 1 R3 GGCAAGTTCTTCAGCTTCTTCCTCATGCCCCAGGTCAGCTACGGTGCTAGC ATGGAGCTG 519 



mouseTas lr2 CTGCGAGACAAGCGGCGCTTCCCTGCCATGCTGCGCACTGTGCCCAGCGCCACCCACCAC 57 9 

ratTas lr2 CTGCGGGACAAGCGGCACTTCCCTAGCATGCTACGCACAGTGCCCAGCGCCACCCACCAC 57 9 

humanTASlR2 CTGCGAGACAAGGTGCGCTTCCCGGCTTTGCTGCGTACCACACCCAGCGCCGACCACCAC 570 

catTaslr2 CTACGGGACAAGCAGCGCTTCCCGGCCCTTCTGCCCACAGCGCCGGGCGCCGATCACCAG 570 
mouseTaslrl . . CTCAGTGGGAAGCGCAAGTTCCCGTCCTTCTTGCGCACCATCCCCAGCGATAAGTACCAG 588 

ratTaslrl CTCAGTGCCAAGCGCAAGTTCCCGTCTTTCCTTCGTACCGTCCCCAGTGACCGGCACCAG 582 

humanTASlRl CTCAGCGTGAAGCGGCAGTATCCCTCTTTCCTGCGCACCATCCCCAATGACAAGTACCAG 585 

catTaslrl CTCGGAGTGAAGCGGCATTACCCCTCGTTTCTGCGCACCATCCCCAGCGACAAGCACCAG 585 

mouseTaslr3 CTAAGTGACCGGGAAACGTTTCCATCCTTCTTCCGCACAGTGCCCAGTGACCGGGTGCAG 579 

ratTaslr3 CTAAGTGACCGGGAAACATTTCCATCCTTCTTCCGCACAGTGCCCAGTGACCGGGTGCAG 579 

catTaslr3 CTGAGCAACCGGGAGATCTTCCCGTCCTTCTTCCGCACGGTGCCCAGCGACCAGGTGCAG 591 

humanTASlR3 CTGAGCGCCCGGGAGACCTTCCCCTCCTTCTTCCGCACCGTGCCCAGCGACCGTGTGCAG 579 



Figure 1C 
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mouseTas lr2 ATCGAGGCCATGGTGCAACTGATGGTTCACTTCCAGTGGAACTGGATCGTGGTGCTGGTG 63 9 

ratTaslr2 ATCGAGGCCATGGTGCAGCTGATGGTTCACTTCCAATGGAACTGGATTGTGGTGCTGGTG 63 9 

humanTAS 1R2 GTCGAGGCCATGGTGCAGCTGATGCTGCACTTCCGCTGGAACTGGATCATTGTGCTGGTG 630 

catTas lr2 ATCGAGGCCATGGTGCAGCTGATGTTGTACTTCCGCCGGAACTGGATCATCGCGCTGGTG 630 

mouseTas lrl GTGGAAGTCATAGTGCGGCTGCTGCAGAGCTTCGGCTGGGTCTGGATCTCGCTCGTTGGC 648 

ratTaslrl GTGGAGGTCATGGTGCAGCTGCTGCAGAGTTTTGGGTGGGTGTGGATCTCGCTCATTGGC 642 

humanTAS 1 Rl GTGGAGACCATGGTGCTGCTGCTGCAGAAGTTCGGGTGGACCTGGATCTCTCTGGTTGGC 64 5 

catTaslrl GTGGAGGCCATGGTGCTGCTGCTGCAGAGCTTCGGGTGGGTCTGGATCTCGGTGGTCGGC 64 5 

mouseTas lr3 CTGCAGGCAGTTGTGACTCTGTTGCAGAACTTCAGCTGGAACTGGGTGGCCGCCTTAGGG 639 

ratTaslr3 . CTGCAGGCCGTTGTGACACTGTTGCAGAATTTCAGCTGGAACTGGGTGGCTGCCTTAGGT 639 

catTas lr3 GTGGCGGCCATGGTGGAGCTGCTGGAGGAGCTCGGCTGGAACTGGGTGGCGGCGGTGGGT 651 

humanTAS 1R3 CTGACGGCCGCCGCGGAGCTGCTGCAGGAGTTCGGCTGGAACTGGGTGGCCGCCCTGGGC 639 

mouseTas lr2 AGCGATGACGATTATGGCCGAGAGAACAGCCACCTGCTGAGCCAGCGTCTGACCAACACT 699 

ratTas lr2 AGCGACGACGATTACGGCCGCGAGAACAGCCACCTGTTGAGCCAGCGTCTGACCAAAACG 699 

humanTAS 1R2 AGCAGCGACACCTATGGCCGCGACAATGGCCAGCTGCTTGGCGAGCGCGTGGCCCGG 687 

catTas lr2 AGCAGCGGCGACTGCGGCCGCGACGACAGCCAGCTGCTCAGCGATCGCCCGGCCGGC 687 

mouseTas lrl AGCTATGGTGACTACGGGCAGCTGGGCGTACAGGCGCTGGAGGAGC TGGCCACTCCA 705 

ratTaslrl AGCTACGGTGATTACGGGCAGCTGGGTGTGCAGGCGCTGGAGGAGC TGGCCGTGCCC 699 

humanTAS 1R1 AGCAGTGACGACTATGGGCAGCTAGGGGTGCAGGCACTGGAGAACC AGGCCACTGGT 702 

CatTaslrl AGCGACGGCGACTACGGGCAGCTGGGGGTGCAGGCGCTGGAGGAGC AGGCCACCCAG 702 

mouseTas lr3 AGTGATGATGACTATGGCCGGGAAGGTCTGAGCATCTTTTCTAGTC TGGCCAATGCA 696 

ratTas lr 3. AGTGATGATGACTATGGCCGGGAAGGTCTGAGCATCTTTTCTGGTC TGGCCAACTCA 696 

catTaslr3 AGTGACGACGAGTATGGCCGGCAGGGCCTGAGCCTCTTCTCCGGCC TGGCCAGCGCC 708 

humanTAS 1R3 AGCGACGACGAGTACGGCCGGCAGGGCCTGAGCATCTTCTCGGCCC TGGCCGCGGCA 69 6 

******* * * * ** 

mouseTas lr2 GGCGATATCTGCATTGCCTTCCAGGAGGTTCTGCCTGTACCAGAACCCAACCAGGCCGTG 759 

ratTas 1 r2 AGCGACATCTGCATTGCCTTCCAGGAGGTTCTGCCCATACCTGAGTCCAGCCAGGTCATG 7 59 

humanTASlR2 CGCGACATCTGCATCGCCTTCCAGGAGACGCTGCCCACACTGCAGCCCAACCAGAACATG 747 . 

catTas 1 r2 GGCGACACCTGCATCGCCTTCCGGGAGACGCTGCCCATGCCCCAGCCCAACCAGGCGGTG 747 

mouseTaslrl CGGGGCATCTGCGTCGCCTTCAAGGACGTGGTGCCTCT--CTCCGCCCAGGCGGGTGACC 7 63 

ratTaslrl CGGGGCATCTGCGTCGCCTTCAAGGACATCGTGCCTTT — CTCTGCCCGGGTGGGTGACC 757 

humanTAS 1R1 CAGGGGATCTGCATTGCTTTCAAGGACATCATGCCCTT— CTCTGCCCAGGTGGGCGATG 760 

catTaslrl CAGGGCATCTGCGTTGCCTTCAAGGACATCATCCCCTT — CTCTGCCCGGCCGGGCGACG 7 60 

mouseTas 1 r3 CGAGGTATCTGCATCGCACATGAGGGCCTGGTGCCACAA-CATGACACTAGTGGCCAACA 7 55 

ratTas lr3 CGAGGTATCTGCATTGCACACGAGGGCCTGGTGCCACAA-CATGACACTAGTGGCCAACA 755 

catTaslr3 AGGGGCATCTGCATCGCGCATGAGGGCCTGGTGCCACTG-C-CGCCA — GGCAGCCTGCG 7 64 

humanTASlR3 CGCGGCATCTGCATCGCGCACGAGGGCCTGGTGCCGCTG-CCCCGTGCCGATGACTCGCG 7 55 
********* ** * ** 

mouseTaslr2 AGGCCTGAGGAGCAGGACCAACTGGACAACATCCTGGACAAGCTGCGGC GGACCTCG 816 

ratTaslr2 AGGTCCGAGGAGCAGAGACAACTGGACAACATCCTGGACAAGCTGCGGC GGACCTCG 816 

humanTAS 1R2 ACGTCAGAGGAGCGCCAGCGCCTGGTGACCATTGTGGACAAGCTGCAGC AGAGCACA 804 

catTas lr2 ACGCAGTGGGAGCGCCGGCGCCTGAAGGCCATCGTGGACGAGCAGCAGCGGCAGAGCTCT 807 

mouseTaslrl C AAGGATGCAGCGCATGATGCTGCGTCTGGCTCGAGCCA GGACCACC 810 

ratTaslrl C -GAGGATGCAGAGCATGATGCAGCATCTGGCTCAGGCCA GGACCACC 804 

humanTASlRl A GAGGATGCAGTGCCTCATGCGCCACCTGGCCCAGGCCG GGGCCACC 807 

catTaslrl A G AGG AT GC AG AGC AT C ATGC ACC AC CT GGCC CG AGCG A GGACCACC 807 

mouseTas! r3 G TTGGGCAAGGTGCTGGATGTACTACGCCAAGTGAACCA AAGTAAA 801 

ratTas lr3 * A --TTGGGCAAGGTGGTGGATGTGCTACGCCAAGTGAACCA AAGCAAA 8 01 

catTaslr3 G CTGGGCGCCCTACAGGGCCTGCTGCGCCAGGTGAACCA GAGCAGC 810 

humanTAS 1 R3 G CTGGGG AAGGTGC AGGACGTCCTGC ACC AGGTGAACC A- GAGCAGC 801 



mouseTaslr2 GCGCGTGTGGTGGTGATATTCTCGCCAGAGCTGAGCCTGCACAACTTCTTCCGCGAGGTG 876 

ratTas lr2 GCGCGCGTCGTGGTGGTGTTCTCGCCCGAGCTGAGCCTGTATAGCTTCTTTCACGAGGTG 87 6 

humanTAS 1R2 GCGCGCGTCGTGGTCGTGTTCTCGCCCGACCTGACCCTGTACCACTTCTTCAATGAGGTG 8 64 

catTaslr2 GCGCGCGTCGTGGTCCTGCTGTCGCCAAAGCTGGTCCTGCACAACTTCTTCCGCGAGGTG 867 

mouseTaslrl GTG GTCGTGGTCTT-CTCTAACCGGCACCTGGCTGGAGTG — TTCTTCAGGTCTGTG 864 

ratTaslrl GTG GTTGTGGTCTT-CTCTAACCGGCACCTGGCTAGAGTG — TTCTTCAGGTCCGTG 858 

humanTASlRl GTC GTGGTTGTTTT-TTCCAGCCGGCAGTTGGCCAGGGTG— TTTTTCGAGTCCGTG 8 61 

catTaslrl GTT GTGGTCGTTTT-CTCCAGCAGGCAGCTGGCCAGGGTG — TTCTTTGAGTCGGTG 8 61 

mouseTaslr3 GTACAAGTGGTGGTGCTGTTTGCCTCTGCCCGTGCTGTCTACTCCCTTTTTAGTTACAGC 8 51 

ratTaslr3 GTACAGGTGGTGGTGCTGTTTGCATCTGCCCGTGCTGTCTACTCCCTTTTTAGCTACAGC 8 61 

catTas 1 r 3 GTGCAGGTGGTGGTGCTGTTCTCCTCCGCCCACGCGGCCCGCACCCTCTTCAGCTACAGC 870 

humanTAS 1R3 GTGCAGGTGGTGCTGCTGTTCGCCTCCGTGCACGCCGCCCACGCCCTCTTCAACTACAGC 8 61 
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CTGCGCTGGAACTTCACAGGCTTTGTGTGGATTGCCTCTGAGTCCTGGGCC ATCGACCCT 936 
CTCCGCTGGAACTTCACGGGTTTTGTGTGGATCGCCTCTGAGTCCTGGGCT ATCGACCCA 936 
CTGCGCCAGAACTTCACGGGCGCCGTGTGGATCGCCTCCGAGTCCTGGGCCATCGACCCG 924 
CTCCGCCAGAACCTCACGGGCGTCGTGCGGATCGCCTCCGAGTCCTGGGCCATCGACCCG 927 
GTGCTGGCCAACCTGACTGGCAAAGTGTGGATCGCCTCCGAAGACTGGGCCATCT-CCAC 923 
GTGCTGGCCAACCTGACTGGCAAAGTGTGGGTCGCCTCAGAAGACTGGGCCATCT-CCAC 917 
GTGCTGACCAACCTGACTGGCAAGGTGTGGGTCGCCTCAGAAGCCTGGGCCCTCT-CCAG 920 
GTGCTGGCCAACCTGACTGCCAAGGTGTGGATCGCCTCAGAAGACTGGGCCATCT-CTAG 920 
ATCCATCATGGCCTCTCACCCAAGGTATGGGTGGCCAGTGAGTCTTGGCTGACAT-CTGA 920 
ATCCTTCATGACCTCTCACCCAAGGTATGGGTGGCCAGTGAGTCCTGGCTGACCT-CTGA 920 
ATCCGCTGCAAGCTCTCACCCAAGGTGTGGGTGGCCAGCGAGGCCTGGCTGACCT-CAGA 929 
ATCAGCAGCAGGCTCTCGCCCAAGGTGTGGGTGGCCAGCGAGGCCTGGCTGACCT-CTGA 920 
* * * ** ** * *** ** *** * 

GTTCTACACAAC CTCACAGAGCTGCGCCACACGGGCACTTTCCTGGGCGTCACCA 991 

GTTCTGCATAAC CTCACGGAGCTGCGCCACACGGGTACTTTTCTGGGCGTCACCA 991 

GTCCTGCACAAC CTCACGGAGCTGGGCC ACTTGGGC ACCTTCCTGGGCATC ACCA 979 

GTCCTGCACGACAGGCCCACGCGCTGCACAGCCTCCTGGGCTGCACCCAGACCAGCAGC- 986 

GTACATCACCAA TGTGCCCGGGATCC AGGGC ATTGGGACGGTGCTGGGGGTGGCCA 979 

GTACATCACCAG CGTGACTGGGATCCAAGGCATTGGGACGGTGCTCGGTGTGGCCG 973 

GCACATCACTGG GGTGCCCGGGATCCAGCGCATTGGGATGGTGCTGGGCGTGGCCA 976 

ACACATCAGCAA TGTGCCCGGGATCCAGGGCATTGGCACGGTGCTGGGTGTGGCCA 976 

CCTGGTCATGAC ACTTCCCAATATTGGCCGTGTGGGCACTGTGCTTGGGTTTTTGC 976 

CCTGGTCATGAC ACTTCCCAATATTGCCCGTGTGGGCACTGTTCTTGGGTTTCTGC 976 

CCTGGTCATGAC GCTGCCCGGCATGCCTGGGGTGGGCACCGTGCTGGGCTTCCTGC 985 

CCTGGTCATGGG GCTGCCCGGCATGGCCCAGATGGGCACGGTGCTTGGCTTCCTCC 976 

* * * * * * 

TCCAGAGGGTGTCCATCCCTGGCTTCAGCCAGTTCCGAGTGCGCCAC GACAAGCCAG 104 8 

TCCAGAGGGTGTCCATCCCTGGCTTCAGTCAGTTCCGAGTGCGCCGT GACAAGCCAG 104 8 

TCCAGAGCGTGCCCATCCCGGGCTTCAGTGAGTTCCGCGAGTGGGGC CCACAGGCTG 103 6 

TCCGGGTCGT- -CTATCCCTGGCA- -GGTGAGGCCC CAC CCACGGA— G 1029 

TCCAGCAGAGACAAGTCCCTGGCCTGAAGGAGTTTGAAGAGTCCTAT GTCCAGGCAG 103 6 

TCCAGCAGAGACAAGTCCCTGGGCTGAAGGAGTTTGAGGAGTCTTAT GTCAGGGCTG 1030 

TCCAGAAGAGGGCTGTCCCTGGCCTGAAGGCGTTTGAAGAAGCCTAT GCCCGGGCAG 1033 

TCCAGCAGAGGCTT^TCCCTGGCCTGAAGGAGTTTGAAGAGGCCTAT GTCCAGGCAG 1033 

AGCGGGGTGCCCTACTGCCTGAATTTTCCCATTATGTGGAGACTCACCTTGCCCTGGCCG 1036 
AGCGCGGTGCCCTACTGCCTGAATTTTCCCATTATGTGGAGACTCGCCTTGCCCTAGCTG 1036 
AGCAGGGCGCCCCGATGCCGGAGTTCCCATCCTACGTGCGGACCCGCCTGGCCCTGGCCG 1045 
AGAGGGGTGCCCAGCTGCACGAGTTCCCCCAGTACGTGAAGACGCACCTGGCCCTGGCCA 1036 



AGT AT CC C AT GC C T A- - ACG AG AC C AGC CT G - AGGACT ACCTG- TAACC AG 1095 

GGTATCCCGTGCCTA— ACACGACCAACCTG CGGACGACCTG-CAACCAG 1095 

GGCCGCC ACCCCTCA — GCAGGACCAGCCAG AGCTATACCTG-CAACCAG 108 3 

AGTCGGGGCCACACAC-GCAGGCGCCGCCAC- — AGCCCTGAGTGGTTGCCAT 107 8 

TGATGGGTGCTCCCAGAACTTGCCCAGAGGG GTCCTGGTGCGGCACTAAC 108 6 

T AACAGCTGCTCCCAGCGCTTGCCCGGAGGG GTCCTGGTGCAGCACTAAC 108 0 

ACAAGAAGGCCCCTAGGCCTTGCCACAAGGG CTCCTGGTGCAGCAGCAAT 1083 

ATAAGGGGGCCCCTGGGCCTTGCTCCAGGAC CTCCGAGTGCAGCAGCAAC 1083 

CTGACCCAGCATTCTGTGCCTCACTGAATGCGGA GTTGGATCTGGAGGAACATGTGA 1093 

CTGACCCAACATTCTGTGCCTCCCTGAAAGCTGA GTTGGATCTGGAGGAGCGCGTGA 1093 

CTGACCCTGCCTTCTGCGCCTCGCTGGACGCTGAAC AGCC AGGCCTGGAGGAGCACGTGG 1105. 

CCGACCCGGCCTTCTGCTCTGCCCTGGGCGAGAGGGAGCAGGGTCTGGAGGAGGACGTGG 109 6 



GACTGTGACGCC — TGCATGAACATCACCGAGTCCTTTAACAACGTTCTCATGCTTT 1150 

GACTGTGACGCC — TGCTTGAACACCACCAAGTCCTTCAACAACATCCTTATACTTT 1150 

GAGTGCGACAAC--TGCCTGAACGCCACCTTGTCCTTCAACACCATTCTCAGGCTCT 113 8 

GGAGACCACTGCCCTGCTCTAGCGTCCCCCTCTCTGGCCGGGTCCTGGGCAAACTGG 1135 

C— AGCTGTGCAGGGAGTGTCACGCTTTCACGACATGGAACATGCCCGAGCTTGGAGCCT 114 4 
C — AGCTGTGCCGGGAGTGCCACACGTTCACGACTCGTAACATGCCCACGCTTGGAGCCT 1138 
C — AGCTCTGCAGAGAATGCCAAGCTTTCATGGCACAC ACGATGCCCAAGCTCAAAGCCT 1141 
C — AGCTCTGTAGAGAGTGTCGGGCTTTCACGGCAGAGCAGATGCCCACGCTCGGGGCAT 1141 
TGGGGCAACGCTGTCCACGGTGTGACGACATCATGCTGCAGAACCTATCATCTGGGCTGT 1153 
TGGGGCCACGCTGTTCACAATGTGACTACATCATGCTACAGAACCTGTCATCTGGGCTGA 1153 
TGGGGCCACGCTGCCCCCAATGTGACCACGTCACGCTAGAGAACCTATCTGCGGGGCTG- 1164 
TGGGCCAGCGCTGCCCGCAGTGTGACTGCATCACGCTGCAGAACGTGAGCGCAGGGCTAA 1156 
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Stop codon in cat T1R2 V 

CG GGGGAGCGTGTGGTCTACAGTGTGTACTCGGCCGTCT 

CG GGGGAGCGCGTGGTCTACAGCGTGTACTCGGCAGTTT 

CT GGGGAGCGTGTCGTCTACAGCGTGTACTCTGCGGTCT 

CG GGAGAGGCCAGGGGACGTACCCTGTCCCCAGACACAT 

TC TCCATGAGCGCTGCCTACAATGTGTATGAGGCTGTGT 

TC TCCATGAGTGCCGCCTACAGAGTGTATGAGGCTGTGT 

TC TCCATGAGTTCTGCCTACAACGCATACCGGGCTGTGT 

TC TCCATGAGCTCTGCTTATAACGCCTACCGGGCAGTCT 

TGCAGAACCTATCAGCTGGGCAATTGCACCACCAAATATTTGCAACCTATGCAGCTGTGT 
TGCAGAACCTATCAGCTGGGCAGTTGCACC^CCAAATATTTGCAACCTATGCAGCTGTGT 

--CTGCACCACCAGACCTTCGCTGCCTACGCGGCTGTGT 

ATCACCACCAGACGTTCTCTGTCTACGCAGCTGTGT 



1189 
1189 
1177 
1174 
1183 
1177 
1180 
1180 
1213 
1213 
1201 
1192 



mouseTaslr2 ACGCGGTAGCCCACACCCTCCACAGACTCCTCCACTGCAACCAGGTCCGCTGCACCA 124 6 

ratTaslr2 ACGCGGTGGCCCATGCCCTCCACAGACTCCTCGGCTGTAACCGGGTCCGCTGCACCA 1246 

humanTAS 1R2 ATGCTGTGGCCCATGCCCTGCACAGCCTCCTCGGCTGTGACAAAAGCACCTGCACCA 1234 

catTaslr2 AA : : 1176 

mous eTas 1 r 1 ATGCTGTGGCCCACGGCCTCC ACCAGCTCCTGGGATGTACCTCTGGGACCTGTGCCA 1240 

ratTaslrl ACGCTGTGGCCCACGGCCTCCACCAGCTCCTGGGATGTACTTCTGAGATCTGTTCCA 1234 

humanTASlRl ATGCGGTGGCCCATGGCCTCCACCAGCTCCTGGGCTGTGCCTCTGGAGCTTGTTCCA 1237 

catTaslrl ACGCAGTGGCCCATGGCCTCCACCAGCTCCTGGGCTGTGCCTCTGGAGCCTGTTCCA 1237 

mous eTas lr 3 ACAGTGTGGCTCAAGCCCTTCACAACACCCTACAGTGCAATGTCTCACATTGCCACGTAT 1273 

ratTaslr3 ACAGTGTGGCTCAGGCCCTTCACAACACCCTGCAGTGCAATGTCTCACATTGCCACACAT 1273 

ca tTas 1 r 3 ATGGCGTGGCCCAAGCCCTTCACAACACACTGCGCTGCAATGCCTCGGGCTGCCCCAGGC 1261 

humanTAS 1R3 ATAGCGTGGCCCAGGCCCTGCACAACACTCTTCAGTGCAACGCCTCAGGCTGCCCCGCGC 1252 



mouseTaslr2 

ratTaslr2 

humanTAS 1R2 

catTaslr2 

mouseTaslrl 

ratTaslrl 

humanTASlRl 

catTaslrl 

mouseTaslr3 

ratTaslr3 

catTaslr3 

humanTAS 1R3 



AGCAAATCGTCTATCCATGGCAGCTACTCAGGGAGATCTGGCATGTCAACTTCACGCTCC 1306 

AGCAAAAGGTCTACCCGTGGCAGCTACTCAGGGAGATCTGGCACGTCAACTTCACGCTCC 130 6 

AGAGGGTGGTCTACCCCTGGCAGCTGCTTGAGGAGATCTGGAAGGTCAACTTCACTCTCC 1294 

GAGGCCCAGTCTACCCCTGGCAGCTTCTTCAGCAGATCTACAAGGTGAATTTCCTTCTAC 1300 

GAGGCCCAGTCTACCCCTGGCAGCTTCTTCAGCAGATCTACAAGGTGAATTTTCTTCTAC 1294 

GGGGCCGAGTCTACCCCTGGCAGCTTTTGGAGCAGATCCACAAGGTGCATTTCCTTCTAC 1297 

GGGACCGAGTCTACCCCTGGCAGCTTCTGGAGCAGATCCGCAAGGTGAATTTCCTCCTAC 1297 

CAGAACATGTTCTACCCTGGCAGCTCCTGGAGAACATGTACAATATGAGTTTCCATGCTC 1333 

CAGAGCCTGTTCAACCCTGGCAGCTCCTGGAGAACATGTACAATATGAGTTTCCGTGCTC 1333 

GGGAGCCTGTGCGGCCCTGGCAGCTCCTAGAGAACATGTACAACGTGAGCTTCCGTGCTC 1321 

AGGACCCCGTGAAGCCCTGGCAGCTCCTGGAGAACATGTACAACCTGACCTTCCACGTGG 1312 



mouseTaslr2 TGGGCAACCAGCTCTTCTTCGACGAACAAGGGGACATGCCGATGCTCCTGGACATCATCC 1366 

r atTa s 1 r 2 TGGGTAACCGGCTCTTCTTTGACC AACAAGGGGACATGCCGATGCTCTTGGACATCATCC 1366 

humanTAS 1 R2 TGGACCACC AAATCTTCTTCGACCCGCAAGGGGACGTGGCTCTGCACTTGGAGATTGTCC 1354 

catTaslr2 — 

mouseTaslrl ATAAGAAGACTGTAGCATTCGATGACAAGGGGGACCCTCTAGGTTATTATGACATCATCG 1360 

ratTaslrl . ATGAGAATACTGTGGCATTTGATGACAACGGGGACACTCTAGGTTACTACGACATCATCG 1354 

humanTASlRl ACAAGGACACTGTGGCGTTTAATGACAACAGAGATCCCCTCAGTAGCTATAACATAATTG 1357 

catTa S 1 r 1 ACAAGGACACCGTGAGGTTTAATGACAACGGGGACCCTCTCAGTGGCTACGACATAATTG 1357 

mouseTaslr3 GAGACTTGACACTACAGTTTGATGCTGAAGGGAATGTAGACATGGAATATGACCTGAAGA 1393 

ratT.aslr3 GAGACTTGACACTGCAGTTTGATGCCAAAGGGAGTGTAGACATGGAATATGACCTGAAGA 1393 

catTaslr3 GCGGCCTGGCACTGCAGTTCGACGCCAGCGGGAACGTGAACGTGGATTACGACCTGAAAC 1381 

humanTAS 1R3 GCGGGCTGCCGCTGCGGTTCGACAGCAGCGGAAACGTGGACATGGAGTACGACCTGAAGC 1372 



mouseTaslr2. AGTGGCAATGGGGCCTGAGCCAGAACCCCTTCCAAAGCATCGCCTCCTACTCCCCCACCG 1426 

ratTaslr2 AGTGGCAGTGGGACCTGAGCCAGAATCCCTTCCAAAGCATCGCCTCCTATTCTCCCACCA 1426 

humanTAS 1R2 AGTGGCAATGGGACCGGAGCCAGAATCCCTTCCAGAGCGTCGCCTCCTACTACCCCCTGC 1414 

catTaslr2 

mouseTaslrl . CCTGGGACTGGAATGGACCTGAATGGACCTTTGAGGTCATTGGTTCTGCCTCACTGTCTC 1420 

ratTaslrl CCTGGGACTGGAATGGACCTGAATGGACCTTTGAGATCATTGGCTCTGCCTCACTGTCTC 1414 

humanTASlRl CCTGGGACTGGAATGGACCCAAGTGGACCTTCACGGTCCTCGGTTCCTCCACATGGTCTC 1417 

catTaslrl CCTGGGACTGGAGTGGCCCCAAGTGGAACTTCAGGGTCATTGGCTCCTCCATGTGGCCTC 1417 

mous eTas lr 3 TGTGGGTGTGGCAGAGCCCTACACCTGTATTACATACTGTGGGCACCTTCAACGGCACCC 14 53 

ratTaslr3 TGTGGGTGTGGCAGAGCCCTACACCTGTACTACATACTGTAGGCACCTTCAACGGCACCC 14 53 

catTas lr 3 TGTGGGTGTGGCAGGACCCGACGCCCGAGCTGCGCACCGTAGGCACCTTCAAGGGCCGCC 14 41 

humanTAS 1R3 TGTGGGTGTGGCAGGGCTCAGTGCCCAGGCTCCACGACGTGGGCAGGTTCAACGGCAGCC 14 32 
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Figure 1F 
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AGACGAGGCTGACCTACATTAG CAATGTGTCCTGGTACACCCCCAACAACACGGTCC 1483 

GCAAGAGGCTAACCTACATTAA CAATGTGTCCTGGTACACCCCCAACAACACGGTCC 148 3 

AGCGAC AGCTGAAGAACATCCA AGAC ATCTCCTGGCACACCGTCAACAACACGATCC 1471 

CAGTTC ATCTAGACATAAATAAGACAAAAATCCAGTGGCACGGGAAGAACAATCAGGTGC 1480 

CAGTTCATCTGGACATAAATAAGACAAAAATCCAGTGGCACGGGAAGAACAATCAGGTGC 1474 

CAGTTCAGCTAAACATAAATGAGACCAAAATCCAGTGGCACGGAAAGGACAACCAGGTGC 1477 

CAGTTC AGCTGGAC AT AAAT AAAACC AAAATCCGGTGGC ACGGGAAGGAC AACC AGGTGC 1477 

TTCAGCTGCAGCAGTCTAA AATGTACTGGC — CAGGCAACC AGGTGC 1498 

TTC AGCTGCAGCACTCGAA AATGTATTGGC CAGGCAACC AGGTGC 1498 

TGGAGCTCTGGCGCTCTCA GATGTGC TGGC AC ACGCCGGGGAAGC AGC AGC 1492 

T C AGG AC AG AGC GC C T G AA GATCCGCTGGCACACGTCTGACAACCAGAAGC 1483 
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CCATATCCATGTGTTCTAAGAGTTGCCAGCCTGGGCAAATGAAAAAACCCATAGGCCTCC 154 3 
CTGTCTCCATGTGTTCCAAGAGCTGCCAGCCAGGGCAAATGAAAAAGTCTGTGGGCCTCC 154 3 
CTATGTCCATGTGTTCCAAGAGGTGCCAGTCAGGGCAAAAGAAGAAGCCTGTGGGCATCC 1531 



CTGTGTCAGTGTGTACCAGGGACTGTCTCGAAGGGCACCACAGGTTGGTCATGGGTTCCC 
CTGTGTCAGTGTGTACCACGGACTGTCTGGCAGGGCACCACAGGGTGGTTGTGGGTTCCC 
CTAAGTCTGTGTGTTCCAGCGACTGTCTTGAAGGGCACCAGCGAGTGGTTACGGGTTTCC 
CAAAGTCTGTGTGCTCCAGCGACTGCCTCGAAGGGCACCAGCGAGTGATTTCGGGTTTCT 
CAGTCTCCCAGTGTTCCCGCCAGTGCAAAGATGGCCAGGTTCGCCGAGTAAAGGGCTTTC 
CAGTCTCCCAGTGCTCCCGGCAGTGCAAAGATGGCCAGGTGCGCAGAGTAAAGGGCTTTC 
CCGTGTCCCAGTGCTCCCGGCAGTGCAAGGAAGGCCAGGTGCGCCGCGTGAAGGGCTTCC 
CCGTGTCCCGGTGCTCGCGGCAGTGCCAGGAGGGCCAGGTGCGCCGGGTCAAGGGGTTCC 



1540 
1534 
1537 
1537 
1558 
1558 
1552 
1543 



mouseTaslr2 
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ACCCGTGCTGCTTCGAGTGTGTGGACTGTCCGCCGGGCACCTACCTCAACCGATCAGTAG 1603 
ACCCTTGTTGCTTCGAGTGCTTGGATTGTATGCCAGGCACCTACCTCAACCGCTCAGCAG 1 60 3 
ACGTCTGCTGCTTCGAGTGCATCGACTGCCTTCCCGGCACCTTCCTCAACC ACACTGAAG 1591 



ACCACTGCTGCTTCGAGTGCATGCCCTGTGAAGCTGGGACATTTCTCAAC ACGAGTG 

ACCACTGCTGCTTTGAGTGTGTGCCCTGCGAAGCTGGGACCTTTCTCAAC ATGAGTG 

ATCACTGCTGCTTTGAGTGTGTGCCCTGTGGGGCTGGGACCTTCCTCAAC AAGAGTG 

ACCACTGTTGCTTTGAGTGTGTGCCCTGTGAGGCCGGGAGCTTCCTCAAC AAGAGCG 

ATTCCTGCTGCTATGACTGCGTGGACTGCAAGGCGGGCAGCTACCGGAAG CATCCAG 

ATTCCTGCTGCTATGACTGTGTGGACTGCAAGGCAGGGAGCTACCGGAAG CATCCAG 

ACTCTTGCTGTTACAACTGCGTGGACTGCAAGGCGGGCAGTTATCAGCGC AACCCAG 

ACTCCTGCTGCTACGACTGTGTGGACTGCGAGGCGGGCAGCTACCGGCAA— AACCCAG 



1597 
1591 
1594 
1594 
1615 
1615 
1609 
1600 



mouseTaslr2 ATGAGTTTAACTGTCTGTCCTGCCCGGGTTCCATGTGGTCTTACAAGAACAACATCGCTT 1663 

ratTas lr2 ATGAGTTTAACTGTCTGTCCTGCCCGGGTTCCATGTGGTCCTACAAGAACGACATCACTT 1663 

humanTAS 1R2 AT G AAT AT G AATGCC AGGCC TGC C C GAAT AAC G AGT GG T C CT AC C AG AGT G AG AC C T C CT 1651 

catTaslr2 : — ; 

mouseTas lrl AGCTTCACACCTGCCAGCCTTGTGGAACAGAAGAATGGGCCCGTGAGGGGAGCTCAGCCT 1 657 

ratTas lr 1 AGCTTCACATCTGCCAGCCTTGTGGAACAGAAGAATGGGCACCCAAGGAGAGCACTACTT 1651 

humanTASlRl ACCTCTACAGATGCCAGCCTTGTGGGAAAGAAGAGTGGGCACCTGAGGGAAGCCAGACCT 1 654 

catTaslrl ACCTCCACAGCTGCCAGCCTTGTGGGAAAGAAAAGTGGGCACCCGCGGGAAGTGAAACCT 1 654 

mouseTaslr3 ATGACTTCACCTGTACTCCATGTAACCAGGACCAGTGGTCCCCAGAGAAAAGCACAGCCT 1675 

ratTas lr 3 ATGACTTCACCTGTACTCCATGTGGCAAGGATCAGTGGTCCCCAGAAAAAAGCACAACCT 1675 

catTaslr3 ATGACCTCCTCTGCACCCAGTGTGACCAGGACCAGTGGTCCCCAGACCGGAGCACACGCT 1669 

humanTAS 1R3 ACGACATCGCCTGCACCTTTTGTGGCCAGGATGAGTGGTCCCCGGAGCGAAGCACACGCT 1660 



mouseTas lr2 GCTTCAAGCGGCGGCTGGCCTTCCTGGAGTGGCACGAAGTGCCCACTATCGTGGTGACCA 1723 

ratTas lr2 GCTTCCAGCGGCGGCCTACCTTCCTGGAGTGGCACGAAGTGCCCACCATCGTGGTGGCCA 1723 

humanTAS 1R2 GCTTCAAGCGGCAGCTGGTCTTCCTGGAATGGCATGAGGCACCCACCATCGCTGTGGCCC 1711 

catTaslr2 ■ 

mouseTaslrl GCTTCTCACGCACCGTGGAGTTCTTGGGGTGGCATGAACCCATCTCTTTGGTGCTATTAG 1717 

ratTaslrl GCTTCCCACGCACGGTGGAGTTCTTGGCTTGGCATGAACCCATCTCTTTGGTGCTAATAG 1711 

humanTASlRl GCTTCCCGCGCACTGTGGTGTTTTTGGCTTTGCGTGAGCACACCTCTTGGGTGCTGCTGG 1714 

catTaslrl GCTTTCCACGCACCGTGGTGTTTTTGACTTGGCACGAGACCATCTCTTGGGTGCTGCTGG 1714 

mouseTas lr 3 GCTTACCTCGCAGGCCCAAGTTTCTGGCTTGGGGGGAGCCAGTTGTGCTGTCACTCCTCC 17 35 

ratTaslr3 GCTTACCTCGCAGGCCCAAGTTTCTGGCTTGGGGGGAGCCAGCTGTGCTGTCACTTCTCC 1735 

catTaslr3 GCTTCGCCCGCAAGCCCATGTTCCTGGCATGGGGGGAGCCAGCTGTGCTGCTACTGCTCG 1729 

humanTASlR3 GCTTCCGCCGCAGGTCTCGGTTCCTGGCATGGGGCGAGCCGGCTGTGCTGCTGCTGCTCC 1720 



Figure 1G 
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mouseTaslr2 
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TCCTGGCCGCCCTGGGCTTCATCAGTACGCTGGCCATTCTGCTCATCTTCTGGAGACATT 178 3 
T ACTGGCTGCCCTGGGCTTCTTCAGTACACTGGCCATTCTTTTCATCTTCTGGAGACATT 1783 
TGCTGGCCGCCCTGGGCTTCCTCAGCACCCTGGCCATCCTGGTGATATTCTGGAGGC ACT 1771 



CAGCTAACACGCTATTGCTGCTGCTGCTGATTGGGACTGCTGGCCTGTTTGCCTGGCGTC 
CAGCTAACACGCTATTGCTGCTGCTGCTGGTTGGGACTGCTGGCCTGTTTGCCTGGCATT 
CAGCTAACACGCTGCTGCTGCTGCTGCTGCTTGGGACTGCTGGCCTGTTTGCCTGGCACC 
CAGCTAATACGTTGCTGCTGCTGCTGGTGACTGGGACTGCTGGCCTGTTTGCCTGGCACT 
TGCTGCTTTGCCTGGTGCTGGGTCTAGCACTGGCTGCTCTGGGGCTCTCTGTCCACCACT 
TGCTGCTTTGCCTGGTGCTGGGCCTGACACTGGCTGCCCTGGGGCTCTTTGTCCACTACT 
CGCTGCTGGCTCTGGCGCTGGGCCTGGCGCTGGCAGCCCTGGGGCTCTTCCTCTGGCACT 
TGCTGCTGAGCCTGGCGCTGGGCCTTGTGCTGGCTGCTTTGGGGCTGTTCGTTCACCATC 



1777 
1771 
1774 
1774 
1795 
1795 
1789 
1780 



mouseTas lr2 

ratTaslr2 

humanTASlR2 

catTaslr2 

mouseTaslrl 

ratTaslrl 

humanTASlRl 

catTaslrl 

mouseTaslr3 
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catTaslr3 

humanTASlR3 



TCCAGACGCCCATGGTGCGCTCGGCGGGCGGCCCCATGTGCTTCCTGATGCTGGTGCCCC 1843 
TCCAGAC ACCCATGGTGCGCTCGGCCGGTGGCCCCATGTGCTTCCTGATGCTCGTGCCCC 184 3 
TCCAGACACCCATAGTTCGCTCGGCTGGGGGCCCCATGTGCTTCCTGATGCTGACACTGC 1831 



TTCACACGCCTGTTGTGAGGTCAGCTGGGGGTAGGCTGTGCTTCCTCATGCTGGGTTCCT 
TTCACACACCTGTAGTGAGGTCAGCTGGGGGTAGGCTGTGCTTCCTCATGCTGGGTTCCC 
TAGACACCCCTGTGGTGAGGTCAGCAGGGGGCCGCCTGTGCTTTCTTATGCTGGGCTCCC 
TAGACACCCCTGTGGTGAAGTCCGCTGGGGGCCGACTGTGCTTCTTCATGCTAGGCTCCC 
GGGACAGCCCTCTTGTCCAGGCCTCAGGTGGCTCACAGTTCTGCTTTGGCCTGATCTGCC 
GGGACAGCCCTCTTGTTCAGGCCTCAGGTGGGTCACTGTTCTGCTTTGGCCTGATCTGCC 
CGGACAGCCCGCTGGTTCAGGCCTCAGGTGGGCCACGGGCCTGCTTTGGCCTGGCTTGCC 
GGGACAGCCCACTGGTTCAGGCCTCGGGGGGGCCCCTGGCCTGCTTTGGCCTGGTGTGCC 



1837 
1831 
1834 
1834 
1855 
1855 
1849 
1840 



mouseTaslr2 

ratTaslr2 

humanTASlR2 

catTaslr2 

mouseTaslrl 

ratTaslrl 

humanTASlRl 

catTaslrl 

mouseTaslr3 

ratTaslr3 

catTaslr3 

humanTASlR3 . 



TGCTGCTGGCGTTCGGGATGGTCCCCGTGTATGTGGGCCCCCCCACGGTCTTCTCCTGTT 1903 
TGCTGCTGGCGTTTGGGATGGTGCCCGTGTATGTGGGGCCCCCCACGGTCTTCTCATGCT 1903 
TGCTGGTGGCATACATGGTGGTCCCGGTGTACGTGGGGCCGCCCAAGGTCTCCACCTGCC 1891 



TGGTAGCTGGGAGTTGCAGCCTCTACAGCTTCTTCGGGAAGCCCACGGTGCCCGCGTGCT 
TGGTGGCCGGAAGTTGCAGCTTCTATAGCTTCTTCGGGGAGCCCACGGTGCCCGCGTGCT 
TGGCAGCAGGTAGTGGCAGCCTCTATGGCTTCTTTGGGGAACCCACAAGGCCTGCGTGCT 
TGGCAGGGGGCAGCTGTGGGCTCTACGGCTTTTTTGGGGAGCCCACGCTGCCCACATGCT 
TAGGCCTCTTCTGCCTCAGTGTCCTTCTGTTCCCAGGGCGGCCAAGCTCTGCCAGCTGCC 
TAGGCCTCTTCTGCCTCAGTGTCCTTCTGTTCCCAGGACGACCACGCTCTGCCAGCTGCC 
TGGGCCTGGTCTGCCTCAGTGTCCTCCTGTTCCCTGGCCAGCCAGGCCCTGCCAGCTGCC 
TGGGCCTGGTCTGCCTCAGCGTCCTCCTGTTCCCTGGCCAGCCCAGCCCTGCCCGATGCC 



1897 
1891 
1894 
1894 
1915 
1915 
1909 
1900 



mouseTas lr2 TCTGCCGCCAGGCTTTCTTCACCGTTTGCTTCTCCGTCTGCCTCTCCTGCATCACGGTGC 1963 

ratTas lr 2 TCTGCCGACAGGCTTTCTTCACCGTCTGCTTCTCCATCTGCCTATCCTGCATCACCGTGC 1963 

humanTASlR2 TCTGCCGCCAGGCCCTCTTTCCCCTCTGCTTCACAATTTGCATCTCCTGTATCGCCGTGC 1951 

catTaslr2 : 

mouseTas lr 1 TGCTGCGTCAGCCCCTCTTTTCTCTCGGGTTTGCCATTTTCCTCTCCTGTCTGACAATCC 1957 

ratTaslrl TGCTGCGTCAGCCCCTCTTTTCTCTCGGGTTTGCCATCTTCCTCTCCTGCCTGACAATCC 1951 

humanTASlRl TGCTACGCCAGGCCCTCTTTGCCCTTGGTTTCACCATCTTCCTGTCCTGCCTGACAGTTC 195 4 

catTaslrl TGTTGCGCCAAAGCCTCCTTGCCCTGGGTTTTGCCATCTTCCTGTCCTGCCTGACCATCC 1954 

mouseTas lr3 TTGCACAACAACCAATGGCTCACCTCCCTCTCACAGGCTGCCTGAGCACACTCTTCCTGC 1975 

ratTas lr 3 TTGCCCAACAACCAATGGCTCACCTCCCTCTCACAGGCTGCCTGAGCACACTCTTCCTGC 1975 

catTaslr3 * TGGCCCAGCAGCCACTGTTCCACCTCCCACTCACTGGCTGCCTGAGCACGTTTTTCCTGC 1969 

humanTASlR3. TGGCCCAGCAGCCCTTGTCCCACCTCCCGCTCACGGGCTGCCTGAGCACACTCTTCCTGC 1960 



mouseTaslr2 GCTCCTTCCAGATTGTGTGCGTCTTCAAGATGGCCAGACGCCTGCCAAGCGCCTACGGTT 2023 

ratTaslr2 GCTCCTTCCAGATCGTGTGTGTCTTCAAGATGGCCAGACGCCTGCCAAGTGCCTACAGTT 2023 

humanTASlR2 GTTCTTTCCAGATCGTCTGCGCCTTCAAGATGGCCAGCCGCTTCCCACGCGCCTACAGCT 2011 
catTaslr2 

mouseTaslrl GCTCCTTCCAACTGGTCATCATCTTCAAGTTTTCTACCAAGGTACCCACATTCTACCACA 2017 

ratTas 1 r 1 GCTCCTTCCAACTGGTCATCATCTTCAAGTTTTCTACCAAGGTGCCCACATTCTACCGTA 2011 

humanTASlRl GCTCATTCCAACTAATCATCATCTTCAAGTTTTCCACCAAGGTACCTACATTCTACCACG 2014 

catTaslrl GCTCCTTCCAACTGGTCTTCATCTTCAAGTTTTCTGCCAAGGTACCCACCTTCTACCGTG 2014 

mouseTaslr3 AAGCAGCTGAGACCTTTGTGGAGTCTGAGCTGCCACTGAGCTGGGCAAACTGGCTATGCA 2035 

ratTas 1 r 3 AAGCAGCCGAGATCTTTGTGGAGTCTGAGCTGCCACTGAGTTGGGCAAACTGGCTCTGCA 2035 

catTas 1 r 3 AAGCGGCCGAGATATTTGTGGGGTCGGAGCTGCCACCAAGCTGGGCTGAGAAGATGCGTG 2029 

humanTAS 1 R3 AGGCGGCCGAGATCTTCGTGGAGTCAGAACTGCCTCTGAGCTGGGCAGACCGGCTGAGTG 2020 



Figure 1H 
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TCTGG ATGCGTTACC ACGGGCCCT ACGTCTTTGTGGCCTTC ATC ACGGCCGTC AAGGTGG 2083 
TTTGGATGCGTTACCACGGGCCCTATGTCTTCGTGGCCTTCATCACGGCCATCAAGGTGG 2083 
ACTGGGTCCGCTACCAGGGGCCCTACGTCTCTATGGCATTTATCACGGTACTCAAAATGG 2071 



CTTGGGCCCAAAACCATGGTGCCGGAATATTCGTCATTGTCAGCTCCACGGTCCATTTGT 
CCTGGGCCCAAAACCATGGTGCAGGTCTATTCGTCATTGTCAGCTCCACGGTCCATTTGC 
CCTGGGTCCAAAACCACGGTGCTGGCCTGTTTGTGATGATCAGCTCAGCGGCCCAGCTGC 
CCTGGGTCCAAAACCACGGTCCTGGCCTATTTGTGGTGATCAGCTCAATGGCCCAGCTGC 
GCTACCTTCGGGGACTCTGGGCCTGGCTAGTGGTACTGTTGGCCACTTTTGTGGAGGCAG 
GCTACCTTCGGGGCCCCTGGGCTTGGCTGGTGGTACTGCTGGCCACTCTTGTGGAGGCTG 
GCCGCCTGCGGGGGCCCTGGGCCTGGCTGGTGGTGCTGCTTGCTATGCTGGCAGAAGCCG 
GCTGCCTGCGGGGGCCCTGGGCCTGGCTGGTGGTGCTGCTGGCCATGCTGGTGGAGGTCG 



2077 
2071 
2074 
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2089 
2080 
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CCCTGGTGGCAGGCAACATGCTGGCCACCACCATCAACCCCATTGGCCGGACCGACCCCG 2143 
CCCTGGTGGTGGGCAACATGCTGGCCACCACCATCAACCCCATTGGCCGGACCGACCCGG 2143 
TCATTGTGGTAATTGGCATGCTGGCCACGGGCCTCAGTCCCACCACCCGTACTGACCCCG 2131 



TCCTCTGTCTCACGTGGCTTGCAATGTGGACCCCACGGCCCACCA-- 
TCATCTGTCTCACATGGCTTGTAATGTGGACCCCACGACCCACCA-- 
TTATCTGTCTAACTTGGCTGGTGGTGTGGACCCCACTGCCTGCTA-- 
TCATCTGTCTAACTTGGCTGGCGGTGTGGACCCCACTGCCCACCA-- 
CACTATGTGCCTGGTATTTGATCGCTTTCCCACCAGAGGTGGTGA— 
CACTATGTGCCTGGTACTTGATGGCTTTCCCTCCAGAGGTGGTGA-- 
CATTGTGTGCCTGGTACCTGGTAGCCTTCCCGCCAGAGGTGGTGA-- 
CACTGTGCACCTGGTACCTGGTGGCCTTCCCGCCGGAGGTGGTGA-- 



■-GGGAGTACCAGC 
— GGGAATACCAGC 
--GGGAATACCAGC 
-GGGAGTACCAGC 
•CAGACTGGTCAGT 
•CAGATTGGCAGGT 
•CGGACTGGCGGGT 
CGGACTGGCACAT 
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ATGACCCCAATATCATAATCCTCTCCTGCCACCCTAACTACCGCAACGGGCTACTCTTCA 2203 
ATGACCCCAACATCATGATCCTCTCGTGCCACCCTAACTACCGCAACGGGCTACTGTTCA 2203 
ATGACCCCAAGATCACAATTGTCTCCTGTAACCCCAACTACCGCAACAGCCTGCTGTTCA 2191 

GCTTCCCCCATCTGGTGATTCTTGAGTGCACAGAGGTCAACTCTGTGGGCTTCCTGGTGG 2194 

GCTTCCCCCATCTGGTGATTCTCGAGTGCACAGAGGTCAACTCTGTAGGCTTCCTGTTGG 2188 

GCTTCCCCCATCTGGTGATGCTTGAGTGCACAGAGACCAACTCCCTGGGCTTCATACTGG 2191 

GCTTCCCTCAGCTGGTGGTGCTTGATTGCACAGAGGCCAACTCACCGGGCTTCATGTTGG 2191 

GCTGCCCACAGA-GGTACTGGAGCACTGCCACGTGCGTTCCTGGGTCAGCCTGGGCTTGG 2212 

GCTGCCCACGGA-GGTACTGGAACACTGCCGCATGCGTTCCTGGGTCAGCCTGGGCTTGG 22 12 

ACTGCCCACAGA-GGCGCTGGTGCACTGCCACGTGCACTCCTGGATCAGCTTCGGCCTGG 2206 

GCTGCCCACGGA-GGCGCTGGTGCACTGCCGCACACGCTCCTGGGTCAGCTTCGGCCTAG 2197 
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AC ACCAGCATGGACTTGCTGCTGTCCGTGCTGGGTTTCAGCTTCGCGTACGTGGGCAAGG 22 63 

ACACCAGCATGGACTTGCTGCTGTCTGTGCTGGGTTTC AGCTTCGCTTAC ATGGGCAAGG 22 63 

ACACCAGCCTGGACCTGCTGCTCTCAGTGGTGGGTTTCAGCTTCGCCTACATGGGCAAAG 2251 

CTTTCGCACACAACATCCTCCTCTCCATCAGCACCTTTGTCTGCAGCTACCTGGGTAAGG 22 54 

CTTTCACCCACAACATTCTCCTCTCCATCAGTACCTTCGTCTGCAGCTACCTGGGTAAGG 2248 

CCTTCCTCTACAATGGCCTCCTCTCCATCAGTGCCTTTGCCTGCAGCTACCTGGGTAAGG 2251 

CTTTCGCCTACAATGGCCTCCTGTCCGTCAGCGCCTTTGCCTGCAGCTACCTGGGCAAGG 22 51 

TGCACATCACCAATGCAGTGTTAGCTTTCCTCTGCTTTCTGGGCACTTTCCTGGTACAGA 2272 

TGCATGCCACTAACGCCATGCTGGCCTTCCTCTGCTTCCTGGGCACTTTCCTGGTGCAGA 22 66 

CGCACGCCACCAATGCCACGCTGGCCTTTCTCTGCTTCCTGGGCACTTTCCTGGTGCGGA 2257 
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AACTGCCCACCAACTACAACGAAGCCAAGTTCATCACCCTCAGCATGACCTTCTCCTTCA 2323 
AGCTGCCCACCAACTACAACGAAGCCAAGTTCATCACTCTCAGCATGACCTTCTCCTTCA 2323 
AGC T GC C C AC CAACT AC AAC G AGGC C AAGTTC AT C ACC CT C AGC AT G ACC TT CT AT T T C A 2311 



AACTGCCGGAGAACTATAACGAAGCCAAATGTGTCACCTTCAGCCTGCTCCTCCACTTCG 
AAC T GC C AG AGAAC T AT AAT G AAGC C AAAT GT GT C ACCT T C AGC C T GC TC CT C AAC T T CG 
ACTTGCCAGAGAACTACAACGAGGCCAAATGTGTCACCTTCAGCCTGCTCTTCAACTTCG 
ACCTGCCAGAGAACTACAACGAGGCCAAATGTGTCACTTTTAGTCTGCTGCTCAACTTCG 
GCCAGCCTGGCCGCTACAACCGTGCCCGTGGTCTCACCTTCGCCATGCTAGCTTATTTCA 
GCCAGCCTGGTCGCTATAACCGTGCCCGTGGCCTCACCTTCGCCATGCTAGCTTATTTCA 
GCCGGCCAGGCCGCTACAATGGTGCCCGCGGCCTCACCTTTGCCATGCTGGCCTACTTCA 
GCCAGCCGGGCCGCTACAACCGTGCCCGTGGCCTCACCTTTGCCATGCTGGCCTACTTCA 



2314 
2308 
2311 
2311 
2332 
2332 
2326 
2317 
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CCTCCTCCATCTCCCTCTGCACGTTCATGTCTGTCCACGATGGCGTGCTGGTCACCATCA 2383 
CCTCCTCCATCTCCCTCTGCACCTTCATGTCTGTGCACGACGGCGTGCTGGTCACCATCA 2383 
CCTCATCCGTCTCCCTCTGCACCTTCATGTCTGCCTACAGCGGGGTGCTGGTCACCATCG 2371 

TATCCTGGATCGCTTTCTTCACCATGTCCAGCATTTACCAGGGCAGCTACCTACCCGCGG 2374 
TATCCTGGATCGCCTTCTTCACC ATGGCCAGCATTT ACCAGGGC AGCT ACCTGCCTGCGG 2368- 
TGTCCTGGATCGCCTTCTTCACCACGGCCAGCGTCTACGACGGCAAGT ACCTGCCTGCGG 2371 
TGTCCTGGATTGCCTTCTTCACCACGGCCAGCGTCTACCAGGGCAAGTACTTGCCCGCGG 2371 
TCACCTGGGTCTCTTTTGTGCCCCTCCTGGCCAATGTGCAGGTGGCCTACCAGCCAGCTG 2392 
TCATCTGGGTCTCTTTTGTGCCCCTCCTGGCTAATGTGCAGGTGGCCTACCAGCCAGCTG 2392 
TCACCTGGATCTCCTTTGTGCCCCTCTTTGCCAATGTGCACGTGGCCTACCAGCCTGCCG 238 6 
TCACCTGGGTCTCCTTTGTGCCCCTCCTGGCCAATGTGCAGGTGGTCCTCAGGCCCGCCG 2377. 



TGGATCTCCTGGTCACTGTGCTCAACTTTCTGGCCATCGGCTTGGGGTACTTTGGCCCCA 2443 
TGGACCTCCTGGTCACTGTGCTCAACTTCCTGGCCATCGGCTTGGGATACTTTGGCeCCA 2 44 3 
TGGACCTCTTGGTCACTGTGCTCAACCTCCTGGCCATCAGCCTGGGCTACTTCGGCCCCA 2431 

TCAATGTGCTGGCAGGGCTGGCCACTCTGAGTGGCGGCTTCAGCGGCTATTTCCTCCCTA 2 434 
TCAATGTGCTGGCAGGGCTGACCACACTGAGCGGCGGCTTCAGCGGTTACTTCCTCCCCA 2428 
CCAACATGATGGCTGGGCTGAGCAGCCTGAGCAGCGGCTTCGGTGGGTATTTTCTGCCTA 2431 
TCAACGTGCTGGCGGCGCTGAGCAGCCTGAGTGGCGGCTTCAGCGGTTATTTCCTCCCCA 2 431 
TGCAGATGGGTGCTATCCTAGTCTGTGCCCTGGGCATCCTGGTCACCTTCCACCTGCCCA 2452 
TGCAGATGGGTGCTATCTTATTCTGTGCCCTGGGCATCCTGGCCACCTTCCACCTGCCCA 2 452 
TGCAGATGGGCACCATCCTCCTCTGTGCCCTGGGTATCCTAGCCACCTTCCACCTGCCCA 2 44 6 
TGCAGATGGGCGCCCTCCTGCTCTGTGTCCTGGGCATCCTGGCTGCCTTCCACCTGCCCA 2437 



AGTGTTACATGATCCTTTTCTACCCGGAGCGCAACACTTCAGCTTATTTCAATAGCATGA 2503 
AGTGTTACATGATCCTTTTCTACCCGGAGCGCAACACCTCAGCCTATTTCAATAGCATGA 2503 
AGTGCT AC ATGATCCTCTTCTACCCGGAGCGCAACACGCCCGCCTACTTCAACAGCATGA 2491 

AATGCTACGTGATTCTCTGCCGTCCAGAACTCAACAACACAGAACACTTTCAGGCCTCCA 2494 
AGTGCTATGTGATTCTCTGCCGTCCAGAACTCAACAATACAGAACACTTTCAGGCCTCCA 2 488 
AGTGCTACGTGATCCTCTGCCGCCCAGACCTCAACAGCACAGAGCACTTCCAGGCCTCCA 2 491 
AGTGCT ACGTGATCCTGTGCCGCCCAAAATTT AACAGCACACAGC ACTTCCAGGCCTCCA 2491 
AGTGCT ATGTGCTTCTTTGGCTGCCAAAGCTCAACACCCAGGAGTTCTTCCTGGGAAGGA 2512 
AATGCTATGTACTTCTGTGGCTGCCAGAGCTCAACACCCAGGAGTTCTTCCTGGGAAGGA 2512 
AGTGCTACCTGCTGCTGCAGCGGCCGGAGCTCAACACCCCTGAGTTCTTCCTGGAAGACA 2506 
GGTGTTACCTGCTCATGCGGCAGCCAGGGCTCAACACCCCCGAGTTCTTCCTGGGAGGGG 2 497 



TTCAGGGCTACACGATGAGGAAGAGCTAG 2532 

TCCAGGGCTACACCATGAGGAAGAGC 2 52 9 

TCCAGGGCTACACCATGAGGAGGGACTAG 2520 

TCCAGGACTACACGAGGCGCTGCGGCACTACCTGA 2529 

TCC AGGACT AC ACGAGGCGCTGCGGC ACT ACC — — 2520 

TTCAGGACTACACGAGGCGCTGCGGCTCCACCTGA 2 52 6 

TCCAGGAGTACACGAGGCGCTGCGGCTCCACCTGA — — 2 52 6 

ATGCCAAGAAAGCAGCAGATGAGAAC-AGTGGCGGTGGTGAGGCAGCTCAGGGACACAAT 2 571 
GCCCCAAGGAAGCATCAGATGGGAAT-AGTGGTAGTAGTGAGGCAACTCGGGGACACAGT 2571 

ATGCC A GAGC ACAGGGCAGCAGTTGGGGGCAGGGGAGGGGAGAATCGGGGCAAAAAC 2 5 63 

GC C CT GGGG AT GC C C AAGGC C AGAAT -GACGGGAACACAGGAAATCAGGGGAAACAT 2 553 



GAATGA 2577 
GAATGA 2577 
AAGTGA 2569 
GAGTGA 2559 
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Figure 2A CLUSTAL W (1 .82) multiple amino acid sequence alignment of T1 Rs: 

mouseTlR2 MGPQARTLHLLFLLLHALPKPVML VGNS DFHLAGDYLLGGLFTLHAN VKS VS HLS YL 57 

ratTlR2 MGPQARTLCLLSLLLHVLPKPGKL VENS DFHLAGDYLLGGLFTLHAN VKS I S HLS YL 57 

humanTlR2 MGPRAKTICSLFFLLWVLAEP AENSDFYLPGDYLLGGLFSLHANMKGIVHLNFL 54 

catTlR2 MGPRAREVCCFITLPRLLAEP AENSDFYLAGDYFLGGLFTLHANVKGIVHLNLL 54 

mouseTlRl MLFWAAHLLLSLQLAVAYCWAFSCQRTESSPGFSLPGDFLLAGLFSLHADCLQVRHRPLV 60 

ratTlRl MLFWAAHLLLSLQL — VYCWAFSCQRTESSPGFSLPGDFLLAGLFSLHGDCLQVRHRPLV 58 

humanTlRl MLLCTARLVG-LQLLISCCWAFACHSTESSPDFTLPGDYLLAGLFPLHSGCLQVRHRPEV 59 

catTlRl • MSLPAAHLVG-LQLSLSCCWALSCHSTETSADFSLPGDYLLAGLFPLHSDCPGVRHRPTV 59 

raouseTlR3 MPALAIMGLS LAAFLELGMGASLCLSQQFKAQGDYILGGLFPLG-STEEATLNQRT 55 

ratTlR3 MPGLAILGLS LAAFLELGMGSSLCLSQQFKAQGDYILGGLFPLG-TTEEATLNQRT 55 

humanTlR3 MLGPAVLGLS LWALLHPGTGAPLCLSQQLRMKGDYVLGGLFPLG-EAEEAGLRSRT 55 

catTlR3 MPGLALLGLTALLGLTALLDHGEGATSCLSQQLRMQGDYVLGGLFPLG-SAEGTGLGDGL 59 

• : **: .*.***.* 

mouseTlR2 QVPKCNEYNMKVLGYNLMQAMRFAVEEINNCSSLLPGVLLGYEMVDVCYL-SNNIQPGLY 116 

ratTlR2 QVPKCNEFTMKVLGYNLMQAMRFAVEEINNCSSLLPGVLLGYEMVDVCYL-SNNIHPGLY 116 

humanTlR2 QVPMCKEYEVKVIGYNLMQAMRFAVEEINNDSSLLPGVLLGYEIVDVCYI-SNNVQPVLY 113 

catTlR2 QVPQCKEYEIKVLGYDLMQAMC FAGEEINSQSSLLPGVLLGYKMVDVSYI-SNNVQPVLH 113 

mouseTlRl TSCDR-SDSFNGHGYHLFQAMRFTVEEINNSTALLPNITLGYELYDVCSE-SSNVYATLR 118 

ratTlRl . TSCDR- PDSFNGHGYHLFQAMRFTVEEINNSSALLPNITLGYELYDVCSE-SANVYATLR 116 

humanTlRl TLCDR-SCSFNEHGYHLFQAMRLGVEEINNSTALLPNITLGYQLYDVCSD-SANVYATLR 1 17 

catTlRl TLCDR-PDSFNGHGYHLFQAMRFGIEEINNSTALLPNVTLGYQLYDVCSE-SANVYATLN 1 17 

mouseTlR3 QPNSIPCNRFSPLGLFLAMAMKMAVEEINNGSALLPGLRLGYDLFDTCSEPVVTMKSSLM 115 

ratTlR3 . QPNGILCTRFSPLGLFLAMAMKMAVEEINNGSALLPGLRLGYDLFDTCSEPVVTMKPSLM 115 

humanTlR3 RPSSPVCTRFSSNGLLWALAMKMAVEEINNKSDLLPGLRLGYDLFDTCSEPVVAMKPSLM 115 

catTlR3 QPNATVCTRFSSLGLLWALAVKMAVEEINNGSALLPGLHLGYDLFDTCSEPMVAMKPSLV 119 

* *: : **** , :***.:***.: *.. : . * 



FLSQID-DFLPILKDYSQYRPQVVAVIGPDNSESAITVSNILS YFLVPQVTYSAITDKLR 175 
FLAQDD-DLLPILKDYSQYMPHVVAVIGPDNSESAITVSNILSHFLIPQITYSAISDKLR 175 
FLAHED-NLLPIQEDYSNYISRVVAVIGPDNSESVMTVANFLSLFLLPQITYSAISDELR 172 
FPAKED-CSLPIQEDYSHCVPRVVAVIGPGNSESTVTVARFLSLFLLPQITYSAISDELR 172 
VLAQQGTGHLEMQRDLRNHS SKVVALI GP DNTDHAVTTAALLS P FLMPLVS YEAS S VI LS 178 
VLALQGPRHIEIQKDLRNHSSKVVAFIGPDNTDHAVTTAALLGPFLMPLVSYEASSWLS 176 
VLSLPGQHHIELQGDLLHYSPTVLAVIGPDSTNRAATTAALLSPFLVPMISYAASSETLS 177 
VLS LLGTHHVEI RADPSHYS PAALAVI GP DTTNHAATTAALLS PFLVPLI S YEAS S VTLG 177 
FLAKVGSQS I AAYCN YTQYQPRVLAVIGPHS SELALI TGKFFS FFLMPQVS YS ASMDRLS 175 
FMAKVGSQS I AAYCN YTQYQPRVLAVIGPHS SELALI TGKFFS FFLMPQVS YS ASMDRLS 175 
FLAKAGSRDIAAYCNYTQYQPRVLAVIGPHSSELAMVTGKFFS FFLMPQVS YGASMELLS 175 
FMAKAGSCSIAAYCNYTQYQPRVLAVIGPHSSELALVTGKFFSFFLVPQVSYGASTDRLS 179 



DKRRFPAMLRTVPSATHHIEAMVQLMVHFQWNWIVVLVSDDDYGRENSHLLSQRLTNTGD 235 

DKRHFPSMLRTVPSATHHIEAMVQLMVHFQWNWIVVLVSDDDYGRENSHLLSQRLTKTSD 235 

DKVRFPALLRTTPSADHHVEAMVQLMLHFRWNWIIVLVSSDTYGRDNGQLLGERVARR-D 2 31 

DKQRFPALLPTAPGADHQIEAMVQLMLYFRRNWIIALVSSGDCGRDDSQLLSDRPAGG-D 231 

GKRKFPSFLRTIPSDKYQVEVIVRLLQSFGWVWISLVGSYGDYGQLGVQALEELATPR-G 237 ' 

AKRKFPSFLRTVPSDRHQVEVMVQLLQSFGWVWISLIGSYGDYGQLGVQALEELAVPR-G 235 

VKRQYPSFLRTIPNDKYQVETMVLLLQKFGWTWISLVGSSDDYGQLGVQALENQATGQ-G 236 

VKRHYPSFLRTIPSDKHQVEAMVLLLQSFGWVWISWGSDGDYGQLGVQALEEQATQQ-G 2 36 

D RET FP S F FRT V PSD RVQL Q A V V T L LQN F S WNW V AA LGSDDDYGREGLSIFSS L AN AR- G 234 

DRETFPSFFRTVPSDRVQLQAWTLLQNFSWNWVAALGSDDDYGREGLSIFSGLANSR-G 234 

ARETFPSFFRTVPSDRVQLTAAAELLQEFGWNWVAALGSDDEYGRQGLSIFSALAAAR-G 234 

NREIFPSFFRTVPSDQVQVAAMVELLEELGWNWVAAVGSDDEYGRQGLSLFSGLASAR-G 238 
. ;*:::**. ;:.,*: : * : : ■* *:. : . 
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mouseTlR2 ICIAFQEVLPVPEPNQAVRPEEQDQLDNILDKLRR-TSARVVVIFSPELSLHNFFREVLR 294 

ratTlR2 ICI AFQEVLPI PESSQVMRSEEQRQLDNILDKLRR-TSARVVVVFSPELSLYSFFHEVLR 294 

humanTlR2 ICIAFQETLPTLQPNQNMTSEERQRLVTIVDKLQQ-STARVVVVFSPDLTLYHFFNEVLR 290 

catTlR2 TCIAFRETLPMPQPNQAVTQWERRRLKAIVDEQQRQSSARVVVLLSPKLVLHNFFREVLR 291 

mouseTlRl ICVAFKDWPLS AQAGDPRMQRMMLRLAR-ARTTVVVVFSNRHLAGVFFRSVVL 290 

.ratTlRl ICVAFKDIVPFS ARVGDPRMQSMMQHLAQ-ARTTVVVVFSNRHLARVFFRSVVL 2B8 

humanTlRl ICIAFKDIMPFS AQVGDERMQCLMRHLAQ-AGATVVVVFSSRQLARVFFESVVL 2B9 

catTlRl ICVAFKDIIPFS ARPGDERMQSIMHHLAR-ARTTVVVVFSSRQLARVFFESWL 2 39 

mouseTlR3 ICI AH EGLV PQHD TSGQQLGKVLDVLRQVNQ-SKVQVVVLFASARAVYSLFSYSIH 289 

ratTlR3 ICI AH EGLV PQHD TSGQQLGKVVDVLRQVNQ-SKVQVVVLFASARAVYSLFSYSIL 289 

humanTlR3 ICIAHEGLVPLPR ADDS RLGKVQDVLHQVNQ- SS VQVVLLFAS VHAAHAL FN YS I S 289 

catTlR3 ICIAHEGLVPLP PGSLRLGALQGLLRQVNQ-SSVQVVVLFSSAHAARTLFSYSIR 292 

: * : ::. ::.**:::: :* : 

mouseTlR2 WNFTGFVWIASESWAIDPVLHNLTELRHTGTFLGVTIQRVSIPGFSQFRVRHDKPEYPMP 354 

ratTlR2 WNFTGFVWIASESWAIDPVLHNLTELRHTGTFLGVTIQRVSIPGFSQFRVRRDKPGYPVP 354 

humanTlR2 QNFTGAVWIASESWAIDPVLHNLTELGHLGTFLGITIQSVPI PGFSEFREWGPQAGPPPL 350 

catTlR2 QNLTGWRIASESWAIDPVLHDRPTRCTASWAAPRPAAPGRLSLAGEAPPTESRGHTRRR 351 

mouseTlRl ANLTGKVWIASEDWAISTYITNVPGIQGIGTVLGVAIQQRQVPGLKEFEESYVQAVMGAP 350 

ratTlRl ANLTGKVWVASEDWAISTYITSVTGIQGIGTVLGVAVQQRQVPGLKEFEESYVRAVTAAP 348 

humanTlRl TNLTGKVWVAS EAWALSRH ITGVPGI QRI GMVLGVAI QKRAVPGLKAFEEAYARADKKAP 349 

CatTlRl ANLTAKVWIASEDWAISRHISNVPGIQGIGTVLGVAIQQRLVPGLKEFEEAYVQADKGAP 349 

raouseTlR3 HGLSPKVWVASESWLTSDLVMTLPNIARVGTVLGFLQRGALLPEFSHYVETHLALAADPA 349 

ratTlR3 HDLSPKVWVASESWLTSDLVMTLPNIARVGTVLGFLQRGALLPEFSHYVETRLALAADPT 349 

humanTlR3 SRLSPKVWVASEAWLTSDLVMGLPGMAQMGTVLGFLQRGAQLHEFPQYVKTHLALATDPA 3 49 

catTlR3 CKLSPKVWVASEAWLTSDLVMTLPGMPGVGTVLGFLQQGAPMPEFPSYVRTRLALAADPA 352 
- . *-★*** . 

mouseTlR2 NETSLRTTC — NQ DC D ACMN I T E S FNN VLML S G ERVVYSVYSAVYAVA 400 

ratTlR2 NTTNLRTTC — NQDCDACLNTTKSFNNILILSG ERVVYSVYSAVYAVA 400 

humanTlR2 SRTSQSYTC — NQECDNCLNATLSFNTILRLSG ERVVYSVYSAVYAVA 3 96 

catTlR2 RHSPEWLPWRPLPCSSVPLSGRVLGKLAGEARGRTLSPDT 391 

mouseTlRl RTCPEGSWCGTNQLCRECHAFTTWNMPELGAFS MSAAYNVYEAVYAVA 398 

ratTlRl SACPEGSWCSTNQLCRECHTFTTRNMPTLGAFS MSAAYRVYEAVYAVA 396 

humanTlRl RPCHKGS WC SS NQLC RECQAFMAHTMP KL KA FS MS S AYNA YRAV YAVA 397 

catTlRl GPCSRTSECSSNQLCRECRAFTAEQMPTLGAFS MS S A YN AYRAV YAVA 3 97 

mouseTlR3 FCASLN-AELDLEEHVMGQRCPRCDDIMLQNLSSGLLQNLSAGQLHHQIFATYAAVYSVA 4 08 

ratTlR3 FCASLK-AELDLEERVMGPRCSQCDYIMLQNLSSGLMQNLSAGQLHHQIFATYAAVYSVA 4 08 

humanTlR3 FCSALGEREQGLEEDVVGQRCPQCDCITLQNVS AGLNHHQTFSVYAAVYSVA 401 

catTlR3 FCASLDAEQPGLEEHVVGPRCPQCDHVTLENLS AG L L HHQT F AA Y AA V YG V A 404 

mouseTlR2 HTLHRLLHCNQVRCTK-QIVYPWQLLREIWHVNFTLLGNQLFFDEQGDMPMLLDIIQWQW 4 59 

ratTlR2 HALHRLLGCNRVRCTK-QKVYPWQLLREIWHVNFTLLGNRLFFDQQGDMPMLLDIIQWQW 4 59 

humanTlR2 HALHSLLGCDKSTCTK-RVVYPWQLLEEIWKVNFTLLDHQIFFDPQGDVALHLEIVQWQW 4 55 

catTlR2 391 

mouseTlRl HGLHQLLGCTSGTCAR-GPVYPWQLLQQIYKVNFLLHKKTVAFDDKGDPLGYYDIIAWDW 457 

ratTlRl HGLHQLLGCTSEICSR-GPVYPWQLLQQIYKVNFLLHENTVAFDDNGDTLGYYDIIAWDW 4 55 

humanTlRl HGLHQLLGCASGACSR-GRVYPWQLLEQIHKVHFLLHKDTVAFNDNRDPLSSYNI IAWDW 456 

catTlRl HGLHQLLGCASGACSR- DRVYPWQLLEQIRKVNFLLHKDTVRFNDNGDPLSGYDI IAWDW 4 56 

mouseTlR3 QALHNTLQCNVSHCHVSEHVLPWQLLENMYNMS FHARDLTLQFDAEGNVDMEYDLKMWVW 4 68 

ratTlR3 QALHNTLQCNVSHCHTSEPVQPWQLLENMYNMS FRARDLTLQFDAKGSVDMEYDLKMWVW 4 68 

humanTlR3 QALHNTLQCNASGCPAQDPVKPWQLLENMYNLTFHVGGLPLRFDSSGNVDMEYDLKLWVW 4 61 

catTlR3 QALHNTLRCNASGCPRREPVRPWQLLENMYNVSFRARGLALQFDASGNVNVDYDLKLWVW 4 64 



mouseTlR2 GLSQNPFQSIASYSPTETRLTY-ISNVSWYTPNNTVPISMCSKSCQPGQMKKPIGLHPCC 518 

ratTlR2 DLSQNPFQSIASYSPTSKRLTY-INNVSWYTPNNTVPVSMCSKSCQPGQMKKSVGLHPCC 518 

humanTlR2 DRSQNPFQSVASYYPLQRQLKN-IQDISWHTVNNTIPMSMCSKRCQSGQKKKPVGIHVCC 514 

catT!R2 

mouseTlRl NGPEWTFEVIGSASLSPVHLDINKTKIQWHGKNNQVPVSVCTRDCLEGHHRLVMGSHHCC 517 

ratTlRl NGPEWTFEIIGSASLSPVHLDINKTKIQWHGKNNQVPVSVCTTDCLAGHHRVVVGSHHCC 515 

humanTlRl NGPKWTFTVLGSSTWSPVQLNINETKIQWHGKDNQVPKSVCSSDCLEGHQRVVTGFHHCC 516 

CatTlRl SGPKWNFRVIGSSMWPPVQLDINKTKIRWHGKDNQVPKSVCSSDCLEGHQRVISGFYHCC 516 

mouseTlR3 QSPTPVLHTVGTFNG T LQLQQS KM YW P — GNQVP VS QC S RQC KDGQVRRVKG FH SCC 523 

ratTlR3 QSPTPVLHTVGTFNG TLQLQHSKMYWP — GNQVPVSQCSRQCKDGQVRRVKGFHSCC 523 

humanTlR3 QGSVPRLHDVGRFNG SLRTERLKIRWHTSDNQKPVSRCSRQCQEGQVRRVKGFHSCC 518 

catTlR3 QDPTPELRTVGTFKG RLELWRSQMCWHTPGKQQPVSQCSRQCKEGQVRRVKGFHSCC 521 
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Figure 2C 
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FECVDCPPGTYLNRSVDEFNCLSCPGSMWSYKNNIACFKRRLAFLEWHEVPTIVVTILAA 57 8 
FECLDCMPGTYLNRSADEFNCLSCPGSMWSYKNDITCFQRRPTFLEWHEVPTIVVAILAA 578 
FECIDCLPGTFLNHTEDEYECQACPNNEWSYQSETSCFKRQLVFLEWHEAPTIAVALLAA 57 4 



FECM PC EAGT FLNT S • 
FECVPCEAGT FLNMS • 
FECVPCGAGTFLNKS- 
FECVPCEAGSFLNKS* 
YDCVDCKAGSYRKHP- 
Y DC V DC KAGS Y RKH P • 
YDCVDCEAGSYRQNP- 
YNC V DCKAGS YQRN P - 



•ELHTCQPCGTEEWAPEGSSACFSRTVEFLGWHEPISLVLLAANT 57 6 

-ELHICQPCGTEEWAPKESTTCFPRTVEFLAWHEPISLVLIAANT 57 4 

-DLYRCQPCGKEEWAPEGSQTCFPRTVVFLALREHTSWVLLAANT 575 

•DLHSCQPCGKEKWAPAGSETCFPRTVVFLTWHETISWVLLAANT 575 

■DDFTCTPCNQDQWSPEKSTACLPRRPKFLAWGEPVVLSLLLLLC 582 

-DDFTCTPCGKDQWSPEKSTTCLPRRPKFLAWGEPAVLSLLLLLC 582 

■DDIACTFCGQDEWSPERSTRC FRRRSRFLAWGEPAVLLLLLLLS 577 

-DDLLCTQCDQDQWSPDRSTRCFARKPMFLAWGEPAVLLLLALLA 580 



mouseTlR2 LGFISTLAILLIFWRHFQTPMVRSAGGPMCFLMLVPLLLAFGMVPVYVGPPTVFSCFCRQ 63 8 

ratTlR2 LGFFSTLAILFIFWRHFQTPMVRSAGGPMCFLMLVPLLLAFGMVPVYVGPPTVFSCFCRQ 638 

humanTlR2 LGFLSTLAILVIFWRHFQTPIVRSAGGPMCFLMLTLLLVAYMVVPVYVGPPKVSTCLCRQ 634 

catTlR2 

mouseTlRl LLLLLLIGTAGLFAWRLHTPVVRSAGGRLCFLMLGSLVAGSCSLYSFFGKPTVPACLLRQ 63 6 

ratTlRl LLLLLLVGTAGLFAWHFHTPVVRSAGGRLCFLMLGSLVAGSCSFYSFFGEPTVPACLLRQ 634 

humanTlRl LLLLLLLGTAGLFAWHLDTPVVRSAGGRLCFLMLGSLAAGSGSLYGFFGEPTRPACLLRQ 635 

CatTlRl LLLLLVTGTAGLFAWHLDTPVVKSAGGRLCFFMLGSLAGGSCGLYGFFGEPTLPTCLLRQ 635 

mouseTlR3 LVLGLALAALGLSVHHWDSPLVQASGGSQFCFGLICLGLFCLSVLLFPGRPSSASCLAQQ 642 

ratTlR3 LVLGLTLAALGLFVHYWDSPLVQASGGSLFCFGLICLGLFCLSVLLFPGRPRSASCLAQQ 64 2 

humanTlR3 LALGLVLAALGLFVHHRDSPLVQASGGPLACFGLVCLGLVCLSVLLFPGQPSPARCLAQQ 637 

catTlR3 LALGLALAALGLFLWHSDSPLVQASGGPRAC FGLACLGLVCLSVLLFPGQPGPASCLAQQ 640 



mouseTlR2 

ratTlR2 

humanTlR2 

catTlR2 

mouseTlRl 

ratTlRl 

humanTlRl 

catTlRl 

mouseTlR3 

ratTlR3 

humanTlR3 

catT!R3 



AFFTVCFSVCLSCITVRSFQIVCVFKMARRLPSAYGFWMRYHGPYVFVAFITAVKVALVA 698 
AFFTVCFSICLSCITVRSFQI VCVFKMARRLPSAYSFWMRYHGPYVFVAFITAIKVALVV 698 
ALFPLCFTrCISCIAVRSFQIVCAFKMASRFPRAYSYWVRYQGPYVSMAFITVLKMVIVV 694 



PLFSLGFAIFLSCLTIRSFQLVIIFKFSTKVPTFYHTWAQNHGAG- 
PLFSLGFAI FLSCLT I RS FQLVI I FKFSTKVPTFYRTWAQNHGAG- 
ALFALGFTI FLSCLT VRSFQLI I IFKFSTKVPT FYHAWVQNHGAG • 
SLLALGFAI FLSCLT IRS FQLVFIFKFS AKVPTFYRAWVQNHGPG- 
PMAHLPLTGCLSTLFLQAAETFVESELPLSWANWLCSYLRGLWAW 
PMAHLPLTGCLSTLFLQAAEIFVESELPLSWANWLCSYLRGPWAW- 
PLSHLPLTGCLSTLFLQAAEIFVESELPLSWADRLSGCLRGPWAW 
PLFHLPLTGCLSTFFLQAAEIFVGSELPPSWAEKMRGRLRGPWAW- 



-IFVIVSSTVHLFLC 
-LFVIVSSTVHLLIC 
-LFVMISSAAQLLIC 
-LFVVISSMAQLLIC 
- LVVLLAT FVEAALC 
-LVVLLATLVEAALC 
-LVVLLAMLVEVALC 
-LVVLLAMLAEAALC 



695 
693 
694 
694 
701 
701 
696 
699 



mquseTlR2 GNMLATTINPIGRTDPDDPNIIILSCHPNYRNGLLFNTSMDLLLSVLGFSFAYVGKELPT 758 

~ratTlR2- * GNMLATTINPIGRTDPDDPNIMILSCHPNYRNGLLFNTSMDLLLSVLGFSFAYMGKELPT 758 

humanTlR2 IGMLATGLSPTTRTDPDDPKITIVSCNPNYRNSLLFNTSLDLLLSVVGFSFAYMGKELPT 754 

catTlR2 — 

mouseTlRl LTWLAMWTPRPTREYQRFPHLVILECTEVNSVGFLVAFAHNILLSISTFVCSYLGKELPE 755 

ratTlRl LTWLVMWTPRPTREYQRFPHLVILECTEVNSVGFLLAFTHNILLSISTFVCSYLGKELPE 753 

humanTlRl LTWLVVWTPLPAREYQRFPHLVMLECTETNSLGFILAFLYNGLLSISAFACSYLGKDLPE 754 

. catTlRl LTWLAVWTPLPTREYQRFPQLWLDCTEANSPGFMLAFAYNGLLSVSAFACSYLGKDLPE 754 

mouseTlR3 AWYLIAFPPEWTDWSVLPTEVLEHCHVRSWVSLGLVHITNAMLAFLCFLGTFLVQSQPG 761 

ratTlR3 ^_AWYLMAFPPEVVTDWQVLPTEVLEHCRMRSWVSLGLVHITNAVLAFLCFLGTFLVQSQPG 7 61 

humanTlR3 TWYLVAFPPEVVTDWHMLPTEALVHCRTRSWVSFGLAHATNATLAFLCFLGTFLVRSQPG 756 

catTlR3 AWYLVAFPPE VVTDWRVLPTEALVHCHVHSWI S FGLVHATNAMLAFLC FLGT FLVQS RPG 759 



mouseTlR2 NYNEAKFITLSMTFSFTSSISLCTFMSVHDGVLVTIMDLLVTVLNFLAIGLGYFGPKCYM 818 

ratTlR2 NYNEAKFITLSMTFSFTSSISLCTFMSVHDGVLVTIMDLLVTVLNFLAIGLGYFGPKCYM 818 

humanTlR2 NYNEAKFITLSMTFYFTSSVSLCTFMSAYSGVLVTIVDLLVTVLNLLAISLGYFGPKCYM 814 

catTlR2 

mouseTlRl NYNEAKCVTFSLLLHFVSWIAFFTMSSIYQGSYLPAVNVLAGLATLSGGFSGYFLPKCYV 815 

ratTlRl NYNEAKCVTFSLLLNFVSWIAFFTMASIYQGSYLPAVNVLAGLTTLSGGFSGYFLPKCYV 8 13 

humanTlRl - NYNEAKCVTFSLLFNFVSWIAFFTTASVYDGKYLPAANMMAGLSSLSSGFGGYFLPKCYV 814 

catTlRl NYNEAKCVTFSLLLNFVSWIAFFTTASVYQGKYLPAVNVLAALSSLSGGFSGYFLPKCYV 814 

mouseTlR3 R YN RARG LT FAML A Y F I TW V S FV P L L AN VQ V A YQP A VQMG A I L VC AL G I L VT FHLPKCYV 821 

ratTlR3 RYNRARGLT FAMLAYFIIWVSFVPLLANVQV A YQP AVQMGAILFCALGILAT FHLPKCYV 8 21 

humanTlR3 RYNRARGLT FAMLAYFITWVSFVPLLANVQVVLRPAVQMGALLLCVLGILAAFHLPRCYL 816 

catTlR3 RYNGARGLTFAMLAYFITWISFVPLFANVHVAYQPAVQMGTILLCALGILATFHLPKCYL 819 
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Figure 2D 

mouseTlR2 ILFYPERNTSAYFNSMIQGYTMRKS 843 

ratTlR2 ILFYPERNTSAYFNSMIQGYTMRKS 843 

humanTlR2 ILFYPERNTPAYFNSMIQGYTMRRD 839 

catTlR2 

mouseTlRl ILCRPELNNTEHFQASIQDYTRRCGTT 84 2 

ratTlRl ILCRPELNNTEHFQASIQDYTRRCGTT 8 40 

humanTlRl ILCRPDLNSTEHFQASIQDYTRRCGST 841 

catTlRl ILCRPKFNSTQHFQASIQEYTRRCGST 841 

mouseTlR3 LLWLPKLNTQEFFLGRN — AKKAADENSGGGEAAQGHNE 8 58 

ratTlR3 LLWLPELNTQEFFLGRS — PKEASDGNSGSSEATRGHSE- 858 

humanTlR3 LMRQPGLNTPEFFLGG GPGDAQGQNDGNTGNQGKHE 852 

catT!R3 LLQRPELNTPEFFLEDNARAQGSSWGQGRGESGQKQVTPDPVTSPQ 8 65 
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Figure 3 

Phylogenetic Tree of T1Rs: 
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Figure 4. 

Predicted conformation of the 7TM T1R3 protein sequence from cat 

Arrow points to region of possible functional amino acid substitution. 
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Figure 5A Predicted conformation of the 7TM T1R1 protein sequence from cat. 
Figure 5B Predicted conformation of the cat T1R2 protein sequence. 
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Figure 6A Genomic sequences of cat T1R1 obtained from BAC sequencing 

CTGGAAAAAAAGGNGAACCCAGGATGATTCACCCCAAAATTTCAGTNTCAGAAAANTGAGGACTGGNA 
GGAGGTCAACTTAAAGTCAGTTTCATTTGGTAAACTGAGGCCCAGGTAAAAAGTTCTAAAACCCACAG 
CTCCCTTCCATATTCTGTCCCCCAGAGAAGCAGTGTCCCTGCCTTCCTCTGACCCCTGCCCCTCAAGA 
CGCCTGGGCTCCCTTTCTGAGCCGGGTGAAGCCGCAGGCACCAGAGCGAGAACAGAACCCACAACCAT 
CCAGAGGGAGGGGCAGCGGCCACCACCTGGCTTGCACCTGTGCCTTCACCCTGCCCAGTTCCTGAGTA 
GGACCGCAGGCCCGGAAGGCCAAGGCAAACAGCCTGGTTCCTACGACTGGGTTCCAGCCCCACCCCTG 
GCACAGGCGTGAAGTTGGGAAGCATCTGGGCAGCCGCTGTCTATTCTATTTAAACAGCCGAGCTGGTC 
AGAGGGTGCTGGCTGGCCATGCCAGGCACAGGACGGACTGGCCAGCATGTCACTCCCGGCGGCTCACC 
TGGTCGGCCTGCAGCTCTCCCTCTCCTGCTGCTGGGCTCTCAGCTGCCACAGCACAGAGACGTCTGCC 
GACTTCAGCCTCCCTGGGGATTACCTCCTCGCAGGTCTGTTCCCTCTGCACTCTGACTGTCCGGGCGT 
GAGGCACCGGCCCACGGTGACCCTCTGTGACAGGTGAGTGAGGGGTCCCGTGCCTCTAGGACCTCTGC 
CCATCCTCTGTCCTCCTCAGTGAGGATCCTTGGGTTGTTGATTGAGTGGAGTTAGGGCCTTTTAGAGA 
GCTGAGACTCTAGAAGCTAAACCACGTGTTGCTTTACCTGTCTTCCACCCTGAGGATCACACGTTAAG 
TGTTCTTACCAGTCAAAATTGAATATGTATCAAACAAAAATAAATGGCCTTCCATGCTGAAATAACAA 
AAAACAGACACGCATGGAGAACCTACTTTGTGGGGCGCCTGGGTGGCCCAGTCGGTTAAGTGTCTGCC 
TCTTCGTTTTGGCTCAGGTCATGACCTCGGGGTTCATGAGTTCGAGCCCCGCGTCAGCTCCGTGATGA 
GCCTGGAGCCCGCTTGGAATTCCCTCCCCACCCCCACCCCCCGCTCATGCCAGCTCGAGCTCTCGCTC 
ACTCTCTCAAAATAAACTTAAGAGGGGCGCCTGGGTGGCGCAGTCAGTTAAGCGTCCGACTTCAGCCA 
GGTCACGATGAGCACATTATTTCCTGGACCTTCCATTCTCCTTTCGCTGTACAGAGCTTAACGTAAAC 
TCCCTGGCAAGACCTCCTTTCTGATTTTAGAAAGGCCAGCTTATTGGTTTGGTTCCTGTAATAGCTTA 
AAAATAGAATCCAGCTGTATCAGGAAACATTTAAAAAATGTATCAAGGAAGACCTATAACAGTAAAAA 
TATTTTTAAATCCCAGAGTGTTTTCATAAAGACACAGGATTACATTACTCAATTATTTTTAAAGGGTT 
TTTGAAAAGCCGTGTTTCACTTGCCATGGCTAATGATTATAGGCATCCGAATGAGCCTGTGGCTATGA 
CTTCAGTCTGTTCGGTGGAAATGACTCTGATGTCATAAACTGACTCGGCTTCGCTGACAGGAAAGTCG 
TACAGAAGAAAAGCTGTTCGAGCCCATATGTTGGTTGCGCTCAATGTCAGGAAGGGGCGACGTAATGT 
G T G C AG AAAT G G G C AGCT GT C G AGAG T G AAG AAAT T G G G AAG T T G G C AC G G AAGAGG G G AC C G AGT C C 
GAGAAGGCTGCTGGATAAAGCAGAGCTTTTGCAGAAGAGAAGGGCCGGCTGCTGTCCCTATCCTGGTG 
GCGGAACCACTTAGAAACAAGGCGTCAGAATTAGAGACTTCGGTTCATGCAGGGAGGGCGGCCCAGGG 
GGGTGGCGTCCTTGGAAACTCTGGTAAGTTTGAGATTGATCCCAGGGGTCGTGGGATGGAGCCTCGCA 
TGAGACTCTACACTGATCGATGAGAAGCAGAAGCCCCTTGTCTGTGAGGAAGGGGACACGAGCAGTTG 
GCACACTAAAACGCAAGGACACGTTTCTACGAGAAAACGGTACATCTGTCTGCGACACAGAAAGATCC 
CCGGNACCAGTCNTCGNNNNNNNNTTCCGNTGGGATTCCAGTCAGCAGTTCCCGAGAGGCACTGAGGA 
ACACAGGCCCTCACCACGTTCACAAGTGTCCTGATGAGAGGGATACTAGGTAAACGAGGTTCGA: CAG 
GTGTGGTGGTTAATTTTATACATCAACCTGGCTAGGGTACGGTGCCCAGTTGTTTGGCCAAACACCAG 
TCTAGATGGGGCTGTGAAGGTTAACATTTAAACCAACAGGGTGAGTAAAGCAGATCGCTTTCCATTGT 
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Figure 6B 

GTGGGTGGGCCTCATCCAATCAGTTGAAGACCTTAAAAGAAAAGATTGAGGTCCCCCCAAAAAGGAAG 
AAATTCTGCCTTCGAACTCAACACTGCAGCTTTGACCACTGAGAGCATTTCCAGCCTGCCCTGCAAAC 
GCCAGACTCACCAGCCCCACAATCATGTGAACCAATTCCTTAAAATAAACTTCTCTTTCTCTCTCTCT 
ATCCAACTGGTTCTGTTTCTCTGCAGAACCCTGACTCACGCAGCAGGTTTCCCTGCTACAGGACTTCA 
TCAGCCTTTCAACCCTAATATGCTCATCCAGGG^GGAATGGTTTGTGGTTTCTCCAAGTTGTAACCGC 
CCCTCCCCCCCCGCCCCCGCCCCCCGAAAGGCCTGTTAACACAGCTGAGTGTATGGTACAGGGCCCAC 
AGTGAGGTCATGGTGGTAGGGGACGGGACAGATGCCCTCAGAGTTTCCTTTCTACCCTTCCCCCCACC 
CCCGACGCCAAGAGGGTCTCGGCAAGGCCTTGCTCCTCTGAGCTCTCAGCTGGGCTTTCTCTACAGGC 
CCGACAGCTTCAACGGTCACGGCTACCACCTCTTCCAGGCCAJTGCGGTTTGGCATCGAGGAGATAAAC 
AACTCCACGGCCCTCCTGCCGAACGTCACCCTGGGATACCAGCTGTACGACGTGTGCTCGGAGTCTGC 
CAACGTGTATGCCACACTAAACGTGCTCTCCCTGCTGGGGACACATCACGTAGAGATCCGAGCAGACC 
CTTCCCACTATTCGCCTGCCGCCCTGGCTGTCATTGGGCCTGACACCACCAACCACGCAGCCACCACT 
GCAGCCCTGCTGAGCCCCTTCCTGGTGCCCCTGGTGAGCTGGAGCCCGGGGGCCTGT.CCATCTCCCCT 
GCCGGCAGGTCCAGTGTGGGCTGAGGGGGTGGGGGGGTGGGCAAGAGCTGCCATGCCCACTCTGAGTC 
TCCTGGGTGGTCACATTGCAGGGGGCCCTGCCCCCTTCACAGTCCCCGCCCCAGCATCCCTTCCTCCC 
CAAGTGCTGCATCCAGACCTCCCTGCCTCAATGTCCTGAGAAAAACCGTCTCCTTTGAAACTGCTGCC 
CTTTGCTCTGCCCCCTCCATTCCATCTCCTCTGTGAAGAACGGAACACCCTTTGTTTCCCACCTCACA 
CACTTGTCCACTTCTCCCCGCCCTCCTCCTTCCGGTCTTCCTTCCCTCCCTCCCAGCTCAGGCTCAGA 
GGTGTGGTCCCCCTCCCCCTCCAATGC^GTGGTCCTGGGCCTCACCCTCTCCTCTGCTC.GTAGGCCTG 
TCCTAGGCTTCCTGCTCCGCCTATAAGCTGGCTTTACCCC.TCTCTGTCTTCCAGGCACCTGTGGTCTT 
AGCGCTGCCCTCTCTCTGAACCTCGTTCCGTGGAAACTTGTGCACTGAGCTCTCT'CTTCTTGTTTGCT 
TCTCCCTCTCATCACTTGCTTCCCGGGCCCCTGCCCTGACTGCTGCACCACCACTCCTGCTCTTGTGA 
TCTCCAGGGCTTTCTAGATCTCCAGGTCCAGCAAATGCTTTTCAGCCCTTCTTTGCTTGACATGACGA 
CTTTGTGACAAATTTGACCAGTCCTTCAGTGACGCTCTTGCCTCGGCATTTATGACCTGCCACCTCCC 
TCTCACTTGTGGTACCTCCTTCTCAGTCTCCTTTGGAGAATCTCCTCCCCCCCTCTTCTGAAAAAG.TG 
GATGATTCCGCGAGTGCAGGACCACTCCCTTTCCCAGGCAGGTGCTGGGAGCAAACAACTTTCCCTAC 
TCTTCAAGAATCTTTCTGGCTGGTCTAAAAATAAGTTGATGTGACACAGANAAAAGGAAAAGTCAAAT 
CACGTATGTACAGGGANCTACNAAACACGAAAGGTCAAGANAGGAAAGNGAGGCTANCTGCTATCTGA 
ACTATGAACAAGGGNAGGGGTAAATTCAAGGAAAGAAGAAATCANAGAAAGAAGAGGNANGGTATAAA 
AGNTGCTGGCCATCAAAAATGGAAGGAAGAATTANAANNGATTGGAGNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCTTTTTCCCGTCACCGGT'GGCCAGGGTTAAA 
TTCAGGCTGCCAAGCTGTTTTTTGGGATGACTCCAGCAGTCTCCTAGGGAGTTCTTCCTGACTCTGGT 
CTTGAGCCTTTTCTAACACATTCTTCACTGAAATCAGATACACCCCTGAAACACAAGTCTGGGCAGAT 
TACCTCTCTGCCTAGACATTTAAGGGGCTCCCCAGGGCCTGCAGATAAAGACCAAGTATCTTAGCTAT 
CTTGGTGCCAGGAGTAAGGCCTCCTGCCCTGACCAGACACGCCTACTTTTGTGCTCCTTCTTCCGGCT 
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Figure 6C 

TCCAACCTCCTGGGTCAGTTCTCTCACTGGGTGTAGCTTTTGTTCTCTTCCCCTTCTTCTCCCACAAA 

CCTCCCCCTGGGTTTCTGCCTCTTCTTTAGATGTAGCTGGTCGGCCTCCTAGTCCACCAGAGCTGTCC 

TTGAGAGCCAGGGCTGGGACCATGTCTCCCTCCTCCTCGGGTCCCCGCGCCCAGCACAGGGCCAGCAC 

TTGGAGGCTCTGAGTTGAGGCCAAGGCCACTGAAGTCGCTGAACTGAACCCCCCCCCCGGCCCCCCTC 

CGCAGATCAGCTACGAGGCCAGCAGCGTGACGCTCGGAGTGAAGCGGCATTACCCCTCGTTTCTGCGC 

ACCATCCCCAGCGACAAGCACCAGGTGGAGGCCATGGTGCTGCTGCTGCAGAGCTTCGGGTGGGTCTG 

GATCTCGGTGGTCGGCAGCGACGGCGACTACGGGCAGCTGGGGGTGCAGGCGCTGGAGGAGCAGGCCA 

CCCAGCAGGGCATCTGCGTTGCCTTCAAGGACATCATCCCCTTCTCTGCCCGGCCGGGCGACGAGAGG 

ATGCAGAGCATCATGCACCACCTGGCCCGAGCGAGGACCACCGTTGTGGTCGTTTTCTCCAGCAGGCA 

GCTGGCCAGGGTGTTCTTTGAGTCGGTGGTGCTGGCCAACCTGACTGCCAAGGTGTGGATCGCCTCAG 

AAGACTGGGCCATCTCTAGACACATCAGCAATGTGCCCGGGATCCAGGGCATTGGCACGGTGCTGGGT 

GTGGCCATCCAGCAGAGGCTTGTCCCTGGCCTGAAGGAGTTTGAAGAGGCCTATGTCCAGGCAGATAA 

GGGGGCCCCTGGGCCTTGCTCCAGGACCTCCGAGTGCAGCAGCAACCAGCTCTGTAGAGAGTGTCGGG 

CTTTCACGGCAGAGCAGATGCCCACGCTCGGGGCATTCTCCATGAGCTCTGCTTATAACGCCTACCGG 

GCAGTCTACGCAGTGGCCCATGGCCTCCACCAGCTCCTGGGCTGTGCCTCTGGAGCCTGTTCCAGGGA 

CCGAGTCTACCCCTGGCAGGTAAGGTAGCCCAGACCCCGGCACCCTGAAACGGGGTGCTTTCCTAAGG 

CAAACAGAGTGATCCCTCTCTGGCCAACTGAGTGCTGGGGGTGGGGGACAAAGGCCACCCATCAGAAG 

GCTAATTCCTTCTCTTGGGCTTCACTTCTCTGACCTCGGCCCCTCCCACCACCATGCTCCAGACCCAG 

GGCTAAAAATGTCTGGGAAACGGGCCTTTTTAGAAGCTTCCTCTCACTCAGGAGGCCAG.TTGGGAGGG 

TCGAGGGGCTTCCTTGGAAGGGAGGGGGCTCTGAATTTCCAGACAGACTGAAACCACCCAAATAGAAG 

CATTTGCTTCCTAAGeCTTCCGGGTCTGGGAGAGTTGAGGAGGAGCAGCCTGCGTCATCTGTGGCTGC 

TCCATGATCCCCGTTTATCTCAGCTTCTGGAGCAGATCCGCAAGGTGAATTTCCTCCTACACAAGGAC 

ACCGTGAGGTTTAATGACAACGGGGACCCTCTCAGTGGCTACGACATAATTGCCTGGGACTGGAGTGG 

CCCCAAGTGGAACTTCAGGGTCATTGGCTCCTCCATGTGGCCTCCAGTTCAGCTGGACATAAATAAAA 

CCAAAATCCGGTGGCACGGGAAGGACAACCAGGTAATGGAGCCATGGTCACTCACCAAGTCACCGCCT 

TACGGGCAGCCTGGAGCCTGAAGTCACTGTCGACACAGCTCACACGGAGCAGGAGGGGGCCCCGGGTG 

CCAGGCCAACGTGGCTCTATCCAGCCCTGCCAGGGAAGCCCCACAGACCGCACCCAGATGGCCGGCTG 

CAGCTGGTATACACAACCAGGGGCTGTGCCCTGGGAGTGAGCTGTGAGGGCAGATGCACGGAGACTCC 

CATTCGCCATGTGAGCATCCCTTGACTTGGGCCACTCCATGTGGTTCCAGAACACCTGTGGCTTCTTG- 

CAGGTGCCAAAGTCTGTGTGCTCCAGCGACTGCCTCGAAGGGCACCAGCGAGTGATT.TCGGGTTTCTA 

CCACTGTTGCTTTGAGTGTGTGCCCTGTGAGGCCGGGAGCTTCCTCAACAAGAGCGGTGAGTGTCCAA 

ATGAGTGGGAGAATGACTGGGCACTCCCAGGGTCTGTATGGCAGATGAGGGGATCTCCCTTGGGCCAC 

GCACGTGCAGAACCAGAGCCTTGCTCCCTCTGTTGCCAGTTGAGGTACAGGTTGTAGAATATTTGCCA 

CCAGACTGAGTTCTGATGAAGCAGAAACCAACAACCAGTTGAAATCCTCAGGTCCCCTACGTCTTTTA 

CTAGAGGGCTCCTGATGCAATCCCTGCAGATGCAATCTTATCCTAAATTCAACCTTTTTATGCGAACA 
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Figure SD 

..GATGTAGTTATGTTCCCTTGTCCCCTCCCATGCTGTCTGTGTGAAGTCCCTTCCGTCGCCCCTGCCAA 
AGACAGCCAGCACCTTGGACAGCTTGGCCTTGATGCAGATACTATTGTATCCGCAGACAAGAAACATA 
GCATACTCCACCCAGTGATGGTGCAAGGTCAAGATCAGAGAGCAAACTCAGGTAGCTAAGGGCTCAGC 
CCAGAGCTG'GACTCTGTGAGCCACGTTCTTTCCTTTTACTATCTCTGTGGGCGTGAGAACACATCTCT 
TCTGTTCTCAGAGAGTCAGAGAAACCACAGAATGGCAGCACAGATAGGGGGCTTTGGGTAATGGAAGC 
GCTGGGGAGATGAAAATGCCCTTCCTTTGGGGCTGGTTGCTCCTGTTGGATCATAGCCTCACTGGCAT 
GTGGGCAGAGCTACCAGAGTAAGGCCCTCTCTAAGGATCTCTCGGTTTGCAAGCCCCTTCTGGGATCA 
TAAGCCATACAGAACCTACCCAAGGGTCTCCAGAATCTGCAATTAACACAGGCATCTGGAGGAAACAC 
TTGGCCGCGGGGCCCCACTCAGGGCTACCCCCTATCTCGCTGTGTGCAGTAGGAGCCCGGCTTCTGGG 
GTACAGCGCTCCCAGCACCTTGCAGGCCTACATGGCTTCCCTTCCTCATTCCTGCTCTGCTCATCTAG 
GCTCTCAGGAGCCCCCTCCACCTTTTTCTTCCAGACCTCCACAGCTGCCAGCCTTGTGGGAAAGAAGA 
GTGGGCACCCGCGGGAAGTGAAACCTGCTTTCCACGCACCGTGGTGTTTTTGACTTGGCACGAGACCA 
TCTCTTGGGTGCTGCTGGCAGCTAATACGTTGCTGCTGCTGCTGGTGACTGGGACTGCTGGCCTGTTT 
GCCTGGCACTTAGACACCCCTGTGGTGAAGTCCGCTGGGGGCCGACTGTGCTTCTTCATGCTAGGCTC 
CCTGGCAGGGGGCAGCTGTGGGCTCTACGGCTTTTTTGGGGAGCCCACGCTGCCCACATGCTTGTTGC 
GCCAAAGCCTCCTTGCCCTGGGTTTTGCCATCTTCCTGTCCTGCCTGACCATCCGCTCCTTCCAACTG 
GTCTTCATCTTCAAGTTTTCTGCCAAGGTACCCACCTTCTACCGTGCCTGGGTCCAAAACCACGGTCC 
TGGCCTATTTGTGGTGATCAGCTCAATGGCCCAGCTGCTCATCTGTCTAACTTGGCTGGCGGTGTGGA 
CCCCACTGCCCACCAGGGAGTACCAGCGCTTCCCTCAGCTGGTGGTGCTTGATTGCACAGAGGCCAAC 
TCACCGGGCTTCATGTTGGCTTTCGCCTACAATGGCCTCCTGTCCGTCAGCGCCTTTGCCTGCAGCTA 
CGTGGGCAAGGACCTGCCAGAGAACTACAACGAGGCCAAATGTGTCACTTTTAGTCTGCTGCTCAACT 
TCGTGTCCTGGATTGCCTTCTTCACCACGGCCAGCGTCTACCAGGGCAAGTACTTGCCCGCGGTCAAC 
GTGCTGGCGGCGCTGAGCAGCCTGAGTGGCGGCTTCAGCGGTTATTTCCTCCCCAAGTGCTACGTGAT 
CCTGTGCCGCC'CAAAATTTAACAGCACACAGCACTTCCAGGCCTCCATCCAGGAGTACACGAGGCGCT 
GCGGCTCCACCTGACCAGTGGGGCGGGCAGGGCCTAGCCGGGGAGGTGGGGGGTGGGGGGTGAAGGGG 

.TAGAAGGTGGGGTAGGGGCGCCTCCCCTGCCCTGAGGGTCGAAGGTCGAGCGAGGCGAGCGGGCCCCG 
CGCCCTCCGGGAGGCCTTTTGGACTCCTGTCTTGGCTCGGGTAGTGTACGCTCACGGGAGTCCAGTCC 
AGGCTCCGAGCTGCCAATAAAGCGGTGAAACATGCGTCCTGGCTGCTCTAGCTGTCTGAACCGAGGGT 
GGGGCG 
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Figure 7A Genomic sequences of cat T1R2 obtained from BAC sequencing 

TTAGCTGCTGAAACGCTGCTTTTTAGCAAAAGGCCGTGACCTCATGATGTTATACGTCGTGGAGATTGA 
GAACCAGGTCCTAGCATCTGACTATGTGCTTTGAGTCCCCACTTTTGCTGGTTGTGCAACCCAGGGTGA 
GCTTCGTAAGCTTCTCTGTGCCTCAGTTTTCTCATCTGTGGAATGGGGCCGGTCATAGTCCCCGTTATT 
GTGATCATCGAGCAAGATGGTGAATGGCGAGCACACAGCATGATGCCTAGTTCTTACTGGAACACCTGT 
CCTGGGTCAGGGGCTGTATATAAAGTACTACCTGCCAGGATCAACTTGATCCGGTTCTATTCTGTCTCC 
TGGGTGAGTATCTGTGCCCTTTACTCCCAGATGTTGGAAATGTCAGGGGCATGAGACCTGTCCTTAACC 
GAGTGGCAGAAGGTTAAGTTTGTGTCCGAGATAGCAGGACATGCTTTCTCTACCTCCGCAGGGCGTTCT 
CCCAGACCCCCCAGGGCCCACCATGCCCTGCTAGGAAGGGATCATCCTAATTCTAGCCTCTTCTTCCGC 
CCCAGAGTTCTGAAGCTTCTCCACCTGTCCAGGTGTTTCCCCACCCCTTCAGCCAC.GGCAAGACCGTCA 
CTATGTAAATGTCTGTGCAAATCCCCTGGTGTCAAGCTGCCAGCTCTCTGATGAGGCAGGGCCACCTCC 
GGGGACCCCTCACTTCCCAGCCATGGGACCCCGGGCCAGGGAAGTCTGCTGCTTCATCATCCTGCCGCG 
GCTCCTGGCTGAGCCGGCTGAGAACTCAGACTTCTACTTGGCTGGGGATTACTTCCTCGGCGGCCTCTT 
CACCCTCCATGCCAACGTGAAGGGCATCGTCCACCTCAACCTCCTGCAGGTGCCCCAGTGCAAGGAGTG 
AGTCGCCAATGTGGGGCTGGAAGTGGCGACGGGGGCGGAGTGGGAAGCCTGGGCTGGTCCTGTGCTCCT 
CAGGGGACCACGCCAGGACCAAGGGCTCAAAATGCTCTTCCTCATTCATTGCCAACCTCTCATCCCGCA 
TTATCCCCACCGGCCTGCAGGGAGACCCCATGCAGTTCATGTTACCAAAATCTTTGGCAATTGTATTCT 
GAAATATGGAGAGCTGGTTGTCCCGCCGTGTGTCTTAATAAATAAAGAGTTACAGGGTACTTGAGCCTG 
GAGGGGTTGTAGAGACCACCCCCACCTACTTTGTCAAGTGGGGAACTCCTACTGAGTCCGTGTCAAGTC 
CAAGTCTAGACACCGGGGGTTATGCCTTTGGAAGGCAGAAATGTGGTTTTTCGGTAGCAGGTTCTCAGA 
CTGGAGGGGAAGGTTTGCATTTCTCTAGGGCTGTGGTTAGGTGGGAAGGGGTGCTTCCAGGACCAGAAG 
GGATTTCCTCCACTCACCTTGTCCCCTGTGAGCCCTGGGGGTGGCTGCATCACTCAAGGTTGGGTGAGA 
CACCTTTGTGCAAGTGCGAAGGCTGGGATGGCGGACCCAGCGTGGGATGATGAGATAGTGACTTGCTGC 
AGAGAGGGTGAAGGCGTCCTGTGAGAGAGGGAGAGAAAAAAGTCTGTGACGTCGGGGAAGATCACATGC 
TGGCTTGAGAATGACGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
.NNNNNNNNNNNNNNNNNNNNNNNNNNNGATGTGGAGGTGATRGTGATGGCGGTGATTGTGACGGTGGTA 
TCGGTGATGGTGGTCACAGACAACGCAGTTATAGTGATGGCAGTGGTGATAGGAATAGTAGGTGGTGAT 
GGTCATTCTGGAGATGTGGCAGGTGACAACGATGAGATGAAAATGCCAGAATCTTCTGGAGTGGCTCCT 
TCTTGAGCCACTCCTCGGCTTTCCTATGGCAGGCAGAGGGGACTCCCCGGCTCTCCTGTCCCTTCCCCC 
TCTCACTCTGGACCTGCCTCTCACCCCACCCCACATGGCTCCCCCAGGTATGAT^ATAAAGGTGTTGGGC 
TACGATCTCATGCAGGCCATGTGCTTTGCAGGGGAGGAGATCAATAGCCAGAGCAGCCTGCTGCCTGGC 
GTGCTGCTGGGCTACAAAATGGTGGATGTCAGCTACATCTCCAACAATGTCCAGCCCGTGCTCCACTTC 
CCGGCAAAGGAGGACTGTTCCTTGCCCATCCAGGAGGACTACAGCCACTGTGTGCCCCGTGTGGTGGCT 
GTCATTGGTCCTGGCAACTCTGAGTCCACTGTGACTGTGGCCCGCTTCCTCTCTCTCTTCCTCCTTCCA 
CAGGGGAGGCCCCTGGGTCCTGGGGTAAGGAGCTGGGGGGCAGAGGAGTGGTTATCCAGGGGGCTCACT 
TCCCCCCACCGGTCCTGGGGGTAGGAGGAGGCAGGAAGTAGGGTCAGAATGTCAACCCCAATCCTRGGA 
AGGCAGCCCAGCCACGTGGTTAAGAGCTCAGGCTTGGAGGCAGACAGACCKGGGNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGCCTTCAGAGAGATCATCCTNTCAAGGGGGCCCTTAT 
TCCTTTNCCCCTGGGAGCCCNTCAGTNCCCACCACTTTCTGCAGCNCCCATTCGGGTCTCCGATTCCTC 
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Figure 7B 

CAATCCACTCACTCGCTGTGTGGCTCTGGATAAGTGACTGTCCCTCTCTGAACCTCAGCGTCCTCATCT 
GCAAAGTGGAGACATAACAGCACATCAGAAGGTCGCGAGAATAGGGGCGCCTGGGAGGCTCAGTCGGTT 
AAGCATCCGATTCTGGGTCGCGGCTCAGGTCATGATCTCCCGGTTCGTGAGTTCAAGCCCCGCATCGGG 
CTGTGTGCTGACAGCACAGANCCTGCTTGGGATTCTGTCTTCCCTTCTCTCTGCCCCTCACCTGCTTTT 
GCTCTCTCTCTCTCAAAATAAATAAATAAACTTTTTAAAAAAAAGGAAGGTAGTGAGAAAAAAGCGGGT 
GACAGAGATGGAGAGGGCTCCACGCGGTACCTGGCATGCTGCGAGCCCTCAGAACCCGTTAGCGACGGA 
AGTGACCTGTGTGCGTCGTCACCACCATCCCAGCAGGCCTTGAGGCTTCGACCCTGCCTCCCCCGCAAA 
GCTCACAGTCTCCGAGGCTCCGGGCCACGTCCCCCGGGCGTCCTGTCTGTGTCCCTCGAACCCCGCCCA 
GCCCTGCCGCACCGTGAGCTAGTCAGCGCCTGCTGGGTTCGTGACTCTCTCCGCCATTGTGCACCCTGG 
GGCTGGGGCCACACCCAGGGGCTCCGGTTAATTTAGATGCTTTCTTTCTCTGCCATCTGCTTACCCCCG 
AGCTTGGTTAGAGAGCCTGACTTTGCTGGGAGTCTCCAGAACGTCCCGGGACCTCCCAGCAACCAGCAT 
CTTTATTCTCCCTCCTTAGAACTGATGTGTGCAGTCGCTGTGCCTCTGCAGCTCAGAGCAGGGGTGGTT 
CCTGTGAACTGGGGCCAGGGGTGGTTTCCTGGAGGGGGCAAGGCACCGACTAGCCCTCGAAGAAGGAGC 
CGGGCTTGGCTGAGGTGGGACAGGGGGAGAGCATGAGGTTTTCGGCCAGCTTTCTGTGCCTGGGAACCC 
CCTCTCCCCACAACCCTGGATC'CCAGAGGCCTTAACGGGCCCCAGCTGTAACAGACTCGTCTGTGTCGA 
GCATTCCACAGTAGGTGTCCCCAGGCTCCCTCGGGGCCACCAAAGGACCACAACGACATTACGCGGACA 
GGGTCTCAGATTCCGATGGGTCCCCTGTTTGCTGGAACCATCTCCCTTTGGAAATTTACAGCTCTCTTT 
TCTGGCAGTAACCCCGCCCCTTGGTGCTGGGTACGAAGGGGGCACCCAGAGCGGGGCTCACCCAGCAGC 
GCTGACTGCTGCGTTGTCGGGCTAACGGGTATTAACCGCCTCCCTCGCCGCTCCCATTCTCTTAGCTGC 
TGAAACGCTGCTTTTTAGCAAAGGCCGTGACCTCATGATGTTATACGTCGTGGAGATTGAGAACCAGGT 
CCTAGCATCTGACTATGTGCTTTGAGTCCCCACTTTTGCTGGTTGTGCAACCCAGGGTGAGCTTCGTAA 
GCTTCTCTGTGCCTCAGTTTTCTCATCTGTGGAATGTGTGAGGGGGAGACCTCAGTTTCAAGCGGGGTG 
. GCCAGGAGGGCCTTTCTGACAACTGGACAACGACCTGAGGGAGAGGAAGGAGTGAGGGAGCTATGTGGG 
TGCCTAGAAGAGCGCTCCGGAAGAGGGGGCAGCGAATGCAGAGGCCGGCAGGAGCCTGGTGCGTTGGCT 
GAACCGGTGAGCAGCCCCGGGACCAGGCGGGACAGTAGGAGAAGATGAAGCCAGAGAGGTGAGGGCCGG 
GGTCAGTGGTGGAGCCCCTTGGGGGCCACTGAAGGACTCTGGCTGTCCTCGAGTGACATTAGGAGCTGT 
TGGGGAGTTTTGAGCTGAGGAGTAAGGTGACGGACAAGTGGTCGCAGAGGCCACCCGGCTGCCACGAAC 
AGCAGCAGAGACAGCCAAGGGGAAGGGTGGGGGGCTGTGGTGACCCCGGGAGGGTGGTGATGGTGGCCC 
GGTGAGGCCCTAGCTCACGCTGGCGGCCCTCCGCTCTCCGGCAGATCACCTACAGCGCCATCAGTGACG 
AGCTACGGGACAAGCAGCGCTTCCCGGCCCTTCTGCCCACAGCGCCGGGCGCCGATCACCAGATCGAGG 
CCATGGTGCAGCTGATGTTGTACTTCCGCCGGAACTGGATCATCGCGCTGGTGAGCAGCGGCGACTGCG 
GCCGCGACGACAGCCAGCTGCTCAGCGATCGCCCGGCCGGCGGCGACACCTGCATCGCCTTCCGGGAGA 
CGCTGCCCATGCCCCAGCCCAACCAGGCGGTGACGCAGTGGGAGCGCCGGCGCCTGAAGGCCATCGTGG 
ACGAGCAGCAGCGG.CAGAGCTCTGCGCGCGTCGTGGTCCTGCTGTCGCCAAAGCTGGTCCTGCACAACT 
TCTTCCGCGAGGTGCTCCGCCAGAACCTCACGGGCGTCGTGCGGATCGCCTCCGAGTCCTGGGCCATCG 
ACCCGGTCCTGCACGACAGGCCCACGCGCTGCACAGCCTCCTGGGCTGCACCCAGACCAGCAGCTCCGG 
GTCGTCTATCCCTGGCAGGTGAGGCCCCACCCACGGAGAGTCGGGGCCACACACGCAGGCGCCGCCACA 



Figure 7C 
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GCCCTGAGTGGTTGCCATGGAGACCACTGCCCTGCTCTAGCGTCCCCCTCTCTGGCCGGGTCCTGGGCA 
AACTGGCGGGAGAGGCCAGGGGACGTACCCTGTCCCCAGACACATAAAGCCAGAAGTGCTTCATGGTGA 
CAAAACTCCTTTTTTTACATTAATGTAATCCTCGCCATCCAAGATAGCCTGTCCCGGCAGGAGATTTGG 
GTGAAGTTTCCTGGAAGGAGGCCTGGCAGGCAGTGGGCCCCCTGGGCCCCCTGCCGTTTCTCCAGGGTG 
GCGGCCTTGGGGGAGGACTTCTGTGTTCAGCTCTCTGAGGCTCTGCTTTGGGTTTATGCATCTTCTCTC 
GTCCCAGGTCTGGACGATTCAGAGGAGTAAGGAGGCAAGGAGTCGCCTGGATTCAGACCTGGAATTTAA 
ATCTGTATTTTTCTGATCTGCGTGCACACCCGCGCGTGCACACACACACACCTAACCACGAAGTTTATG 
TAGGTAGAAGATTTTACTGAGGGGGCGCCTGGGTGGCTCAGTCGGTTAAGCGTCCGACTTCAGCCAGGT 
CACGATCTCGCGGTCTGTGAGTTCGAGCCCCGCGTCAGGCTCTGGGCTGATGGCTCNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAGCACCCCGAGGGCCCGGGGGAGGGCACCTGAGCC 
CGTAAAGGGAAACAGGAGTGGCCTCTGAACCCAGGTGATAGGTCTCCGCTGGATGGCAGACGTGACTCC 
CACGGGAGCAGGAATAATGTCGACACATCGGCCGGAAGGGGAGCACTTCCTGGTGTGCAGTCATTGTGC 
TAAGCTCCCAACATTGGGAAACTCATGCGTTGCTTCAGAGCCCGGGAGACAGGGTTTTTGTTGTCCTAC 
TTTACAGAAGAGGAGACTGGAGCTCACGGGGGTTGGGCGACAGGCCCGAGGCTCAGAGCAGGTGGCAGA 
GCTGGTGCCTGAACCCAGGTGTGTCTGACTACAGAGCCGGGGCTCCCAGCCGCTGCCTCCCGGGTGACC 
ACATCTGCGGTCTCATTGCCCCCTTGTAGGGATGTGGACACCCAGTCTCGTGGGGTAGTCACTCTCCCC 
CGGATCGAGCCCGACTTCTTTTTTTTTTTTTAATTTTTTTTTCAACGTTTATTTATTTTTGGGACAGAG 
AGAGACAGAGCATGAATGGGCGAGGGGCAGAGAGAGAGGGAGACACAGAATCGGAAACAGGCTCCAGGC 
TCCGAGCCATCAGCCCAGAGCCTGATGCGGGGCTCGAACTCACGGACCGCGAGATCGTGACCTGGCTGA 
AGTCGGACACTTACCCGAATGCGCCACCCAGGGGCCCAGATCGAGCCCGACTTCTGACGCCAGCGTCGC 
TTCCTTTCCCTGTGGCCTCCCAGCTGCTTCAGGAAATCTGGAAGGTCAACTTCACCCTCCTGGGCCACC 
AGATCTTTTTTGACCAGCGAGGGGACCTACTCATGCGCCTGGAGATCATCCAGGGACGGTGGGACCTGA 
GCCAGAACCTTTCTGGAGCGTCGCCTCCTACTGCCCGGTGCTACGACGGCTGAGGGCCATCCGTGACGT 
CTCCTGGCACACGGCCAACAACACGGTCAGCTCTCGGAGGGCTGGTGGGGGGCTGGGACCTGGGTCTGG 
GCACTGGCTCGTGCAGGGGTGGCAAGGGCCCTGTGGACCTGAGATCCATTATCGAGCACTGATGTCATC 
CCTATTTGTGGGTGTCCCTCCTCCCATTGACTAAGCACTGTGGAAGTCTAGAGCTTTCTGGATCCTCAG 
GACCCAGGGGCTCAGGGGGCTGCACAAAGTGAACGTTAGGTGGACACGTGTGTGCTAAGGACTTCAATT 
CT'CATGTCAACCCTAGGAAATAGAGAGTACTGTTCCTCCTGTCTTTGGGGTTGGGAAACTGGAGGCACA 
GAGGGGGTCGCGTGACCCATAAAAGGCCACACAGCTTTCGCATGTCTCTATACACAGCATTCAGTCTAC 
ATCCCATCGAT TAGTACTCGCGTTTTGGGGACAGTAGCTGTGCCTTCACCTGTGTCTGACATCTGTCAG 
TCTGAAAGCTCCTTTGTTTTACCCTCTTAGCTTACAAGCTGTCAGAATGGCCGCGATGTGGGGAAGGTA 
GAGACTCAGCCTCGTGGGGAAGGGGGGAGGTGGGGGGACCTAAAAGTTCAAAGAGCCAGGGCACCTGGG 
TGGCTCAGTCAGTTAAGCATCCGACTCTGGATCTCAGCTCAGTCTTGATCTCAGGTCGTGAGTTTAGAC 
CCCTGTGTAGGGCTCCGTGCTGGGCGCGCAGCCTACTTAAAAATAATAAAAACAAAAGCNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGATCCCCGTGTCCATGTGTTCCAAGGACTGCCAGCCT 
GGGCAAAGGAAGAAGCCCGTGGGTATTCATCCCTGCTGCTTCGAGTGTCTCGACTGCCTTCCGGGCACC 
TTCCTCAACCAAACTGCAGATGGGACTCACAGACCCACACCCCTGCCCTGCCCTGCCCTGCCCCGCCCT 
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Figure 7D 

GGGGCTCCCAGGGCCCTTCATCTTTGGCAGGGTCTCTGGAGTCTCATCCAGGGGACACAGGTGTCCAAA 

GGCCAGGGACCATGTTTTGACTCCGCTTGTATCTCCCTAACCGCTGGTGTAAGAAAAATCTTCAATGCT 

GTGAGGGCGTGGGGGTGGGAGAAGGAACAGCCCTCAACCAGGCGAGGCTGTAACTGATCCCCTCTGCAC 

ACACATGTAGCTGAGGGCCCAGGGGGGTCAGGCCAGAGAATGTCCACCGGATGAACGAACGAATGAATG 

AATGAACGAACGAACAAACACACAAATGAATGAATGTCTCTGTCCGTAGAAGAAATGTTTCTGGCAGAC 

AGGGCTAGGATCTAATTTCTCTCTGTGGCCTCCCGAGTGCCTCGTGTAGTTCGGAGCATATAATGTTTG 

CTCAGTGAATGTTTATTGAGTGACATCCTTGATGAGAAGAATTGACATCTCCCCCTATAGATCATAAAC 

TCCAGGAAAGGGGGGACAATGTCATCCCTCCAGTGTTTACCACAGTTCACCGTTGGGGCCGAATTATTT 

TTTTTTCATGACTTCACAGATTAGTAACTAAGCGGTTCTGTACATCTACCGATCAGAGTACTTACGACG 

TGCCCAGCAGAGCCCAGGGCACAGGGTAGGTGCTCAACAAAAGTTTGTTTGCAATTGATCAGTAGCCGG 

AAGTCAGGGGGCTCGGTTTTATCCACGTCTGTGCTCTCCATCTCAGATGCCTATCACAGTGGGTGGCGC 

TCAAAAAGAAACTTGAATAAACGGTCGAATGTCCATCTCACCAGAGGGTACGGTCTTGGAAGGGAGGCA 

TTACGGTTGCCAGGCTCTGAGTCAAGGGGACCTTGGACCACATCCTGCCTCTGTAACTGGTTTTGTAAC 

NGCCTGGAGGAGCCTCAGATGCCACATCTGTGAAATGGGGTTGCAGTGAGGATCTGATGGGCCGGTGGA 

TACGAGGGACGCAGTGAGAGGTGCTACGACCGCAGGCATCGCCCTTGGCTCGCCCCCTCCCTACCCCTA 

CAGCCGGCCGGGTGCAGGTGCAGAGGATGTGGG.TGCCGGGAAGGTGGGTGTATCTGATGGAACTGCTGT 

GGGCTCTTGCAGACGAGTTTGGCTGCCGGCCCTGCCCGAGTTGCGGGTGGTCCCGGAGGAACGACGCTT 

CGTGCTTCAAGCGGCGGCTGGCCTCCCTTGAATGACGCGAGGCACCCGCCGTCGCTGTGGCCGTGCTGT 

CCATCCTGGGCTCCCTCTGCACCCTGGCCATCCTGGTGATCTTCTGGAGGCACCGCCACGCGCCCATGG 

TTCGCTCGGCCGGGGGCCCCAGGTGCTTCCCGATGCCGATGCCCCTGGTGTATAGGTGACGGTCTCCAT 

GTACATCGGGCAGCCCGCGTTTTTCATGTGCCTCGGCCACCAGACCCTCTTCACCCTCTGCTTCACCGT 

CTGTATCTCCCGTGTCACCGTGCGCTCTTTCCAGATCGTCCGCGTCTTCAACATGGCCAGGCGCCTCCC 

GCGTGCCTACGGCTACTGGGTCCGCTACCACGGGCCCTGTGTCTTCGTGGCGTCCTTCACGGTGCTCAA 

GATGGTCATCGTGGCGGGCAACGTGCTGGCCGCGACCGCCGAGCCCGCCGCCCGCCCCGACCCCGATGA 

CCCCAAGATCGCGGTTCTCGCCTGCAACTACCACAACGTGCTCCTGTTCGACACCAGCCTGGACCCGCT 

TCTGTCCGTGGCGGGCTTCGGCTTCGCCTACGTGGGCAAGGAGCTGCCCACCACCCACAACGAGGCCAA 

GTTCTTCACCTTCCGCATGACCTTCTACTTCACCTCtTCCATCTCCCTCTGTACCTTCATGTCTGTCTA 

CGAGGGGGTCCTGGTCACCATCCTGCACCTCGTGGTGGCAGTGCTCAACCTTCTGGGCGCTTTGGCC.ee 

TGGGCTACTTCGGCCCCAAGTGCTGCGTGGTCCTCTTCTACCCGGATCACAACACGCCCGTCTACTTCA 

GCAGCATGATTCAGGGCTACACCACCGGGAAGGACTAGCACTGCCCCCTGGCTGCCCAGGGGGCCAGAG 

GGCTCGGTACTGGGAGATGGAGACCAGGGGTGGGGCTGGGGGTGGTGGTGACTCATTCAGCCCCTGCTG 

GGAGCAGGGACACCACCCCGCCCTACTCTCTGATTTGGCCTCCCCCTCCAGGTTCTCTGCACCCTGGCC 

GTTTTTACCCACCCGCTGGTGGATGCCTAAAAATACGCTTTCCCTGCAGCCGTTTGGCTTGCCAGGCAC 

TGCCACCCATGCTAGGGAAAGGAGCCiGGGGTGACCTCCCTATGGGTCTCCAAGACAGAGATGGAGCGAA 

GCAGCCCACAGTCGCCATCTGGTGGTCACAGCGGGTGTCCGCAGGTTCCGGCTCCGGGCAGCCATGCTG 

GAAGGCTGGGCTGGGGCTGGTGTTGGGGGACATCTGCCCGGCATCATTCACTCCCTGCCCACGTGTCTG 

CGCCTCACCTCCCAGACTCCCCCGCCCCCCAGCTTGGGACCCAGCTTGGGACCCAGCTTCTCTGAGTCA 
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Figure 7E 



TGGCTGCGCATAGGGGCTGCTTCATAAATGCTTATGAATAAACCTCCCTTGGGTGAAACGAAGGCGTTT 
CCTTCTTGTTTCCAGAGGTTTCCCCCCTCCCCCCCCCGTCGCCCCAAGAAAGAAGACTGGGATCAGAGA 
• CCTCAGCTTCCATTTCCGCGTTGCCACTTCTGANCCGTGTACTTTGGGCCAATTCTATTTACTGTTTCG 
GANCCTACACGGNCCCTTTCCTNAAATAGGAACAATAAACCAGGGGCACCTTTGACNCACTGTGTAGTA 
N C C AAT T T GAC G AT AANT T T T T T T AAAAG AT T AAATT AAT C N G AT AAAT T 
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SEQUENCE LISTING 



<110> Li, Xia 
— Li, Weihua 

^ Reed, Danielle R. 

Bachmanov, Alexander A. 
3 Brand, Joseph G. 



<120> TASTE RECEPTOR 


OF THE T1R 


FAMILY FROM DOMESTIC CAT 


<130> MON- 


-0298 










<160> 98 












<170> Patentln version 3.2 








<210> 1 

<211> 2569 

<212> DNA 

<213> Felis catus 










<400> 1 
atgcccggcc 


tcgctctcct 


gggcctcacg 


gctctcctgg 


gcctcacggc 


tctcttggac 


cacggggagg 


gcgcaacgtc 


ctgcttgtca 


cagcagctca 


ggatgcaggg 


ggactatgtg 


ctgggtgggc 


tcttccctct 


gggctctgcc 


gagggtacag 


gtcttggcga 


cgggctgcag 


cccaatgcca 


ccgtgtgcac 


caggttctcg 


tctctgggcc 


tgctctgggc 


gctggccgtg 


aagatggcgg 


tggaggagat 


caacaacggg 


tcggccctgc 


tgcccgggct 


gcacctgggc 


tatgacctct 


ttgacacgtg 


ttcagagccc 


atggtggcca 


tgaagcccag 


cctcgtgttc 


atggccaaag 


caggcagctg 


cagcattgcc 


gcctactgca 


attacacaca 


gtaccagccc 


cgcgtgctgg 


ccgtcatcgg 


gccccactcg 


tctgagctcg 


ccctcgtcac 


cggcaagttc 


t tcagcttct 


tccttgtgcc 


tcaggtcagc 


tacggcgcca 


gcaccgaccg 


gctgagcaac 


cgggagatct 


tcccgtcctt 


cttccgcacg 


gtgcccagcg 


accaggtgca 


ggtggcggcc 


atggtggagc 


tgctggagga 


gctcggctgg 


aactgggtgg 


cggcggtggg 


tagtgacgac 


gagtatggcc 


ggcagggcct 


gagcctcttc 


tccggcctgg 


ccagcgccag 


gggcatctgc 


atcgcgcatg 


agggcctggt 


gccactgccg 


ccaggcagcc 


tgcggctggg 


cgccctacag 


ggcctgctgc 


gccaggtgaa 


ccagagcagc 


gtgcaggtgg 


tggtgctgtt 


ctcctccgcc 


cacgcggccc 


gcaccctctt 


cagctacagc 


atccgctgca 


agctctcacc 


caaggtgtgg 


gtggccagcg 


aggcctggct 


gacctcagac 


ctggtcatga 


cgctgcccgg 


catgcctggg 


gtgggcaccg 


tgctgggctt 


cctgcagcag 


ggcgccccga 


tgccggagtt 


cccatcctac 


gtgcggaccc 


gcctggccct 


ggccgctgac 


cctgccttct 


gcgcctcgct 


ggacgctgaa 


cagccaggcc 


tggaggagca 


cgtggtgggg 


ccacgctgcc 


cccaatgtga 


ccacgtcacg 


ctagagaacc 


tatctgcggg 


gctgctgcac 


caccagacct 


tcgctgccta 


cgcggctgtg 


tatggcgtgg 


cccaagccct 


tcacaacaca 


ctgcgctgca 


atgcctcggg 


ctgccccagg 


cgggagcctg 


tgcggccctg 


gcagctccta 


gagaacatgt 


acaacgtgag 


cttccgtgct 


cgcggcctgg 


cactgcagtt 


cgacgccagc 


gggaacgtga 


acgtggatta 


cgacctgaaa 


ctgtgggtgt 


ggcaggaccc 


gacgcccgag 


ctgcgcaccg 


taggcacctt 


caagggccgc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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ctggagctct 


ggcgctctca 


gatgtgctgg 


cacacgccgg 


ggaagcagca 


gcccgtgtcc 


1500 


cagtgctccc 


ggcagtgcaa 


ggaaggccag 


gtgcgccgcg 


tgaagggctt 


ccactcttgc 


1560 


tgttacaact 


gcgtggactg 


caaggcgggc 


agttatcagc 


gcaacccaga 


tgacctcctc 


1620 


tgcacccagt 


gtgaccagga 


ccagtggtcc 


ccagaccgga 


gcacacgctg 


cttcgcccgc 


1680 


aagcccatgt 


tcctggcatg 


gggggagcca 


gctgtgctgc 


tactgctcgc 


gctgctggct 


1740 


ctggcgctgg 


gcctggcgct 


ggcagccctg 


gggctcttcc 


tctggcactc 


ggacagcccg 


1800 


ctggttcagg 


cctcaggtgg 


gccacgggcc 


tgctttggcc 


tggcttgcct 


gggcctggtc 


1860 


tgcctcagtg 


tcctcctgtt 


ccctggccag 


ccaggccctg 


ccagctgcct 


ggcccagcag 


1920 


ccactgttcc 


acctcccact 


cactggctgc 


ctgagcacgt 


ttttcctgca 


agcggccgag 


1980 


atatttgtgg 


ggtcggagct 


gccaccaagc 


tgggctgaga 


agatgcgtgg 


ccgcctgcgg 


2040 


gggccctggg 


cctggctggt 


ggtgctgctt 


gctatgctgg 


cagaagccgc 


attgtgtgcc 


2100 


tggtacctgg 


tagccttccc 


gccagaggtg 


gtgacggact 


ggcgggtact 


gcccacagag 


2160 


gcgctggtgc 


actgccacgt 


gcactcctgg 


atcagcttcg 


gcctggtgca 


tgccactaac 


2220 


gccatgctgg 


ccttcctctg 


cttcctgggc 


actttcctgg 


tgcagagccg 


gccaggccgc 


2280 


tacaatggtg 


cccgcggcct 


cacctttgcc 


atgctggcct 


acttcatcac 


ctggatctcc 


2340 


tttgtgcccc 


tctttgccaa 


tgtgcacgtg 


gcctaccagc 


ctgccgtgca 


gatgggcacc 


2400 


atcctcctct 


gtgccctggg 


tatcctagcc 


accttccacc 


tgcccaagtg 


ctacctgctg 


2460 


ctgcagcggc 


cggagctcaa 


cacccctgag 


ttcttcctgg 


aagacaatgc 


cagagcacag 


2520 


ggcagcagtt 


gggggcaggg 


gaggggagaa 


tcggggcaaa 


aacaagtga 




2569 



<210> 2 

<211> 865 

<212> PRT 

<213> Felis catus 

<400> 2 

Met Pro Gly Leu Ala Leu Leu Gly Leu Thr Ala Leu Leu Gly Leu Thr 
1 5 10 15 

Ala Leu Leu Asp His Gly Glu Gly Ala Thr Ser Cys Leu Ser Gin Gin 
20 25 " 30 

Leu Arg Met Gin Gly Asp Tyr Val Leu Gly Gly Leu Phe Pro Leu Gly 
35 40 45 

Ser Ala Glu Gly Thr Gly Leu Gly Asp Gly Leu Gin Pro Asn Ala Thr 
50 55 60 

Val Cys Thr Arg Phe Ser Ser Leu Gly Leu Leu Trp Ala Leu Ala Val 
65 70 75 80 

Lys Met Ala Val Glu Glu lie Asn Asn Gly Ser Ala Leu Leu Pro Gly 
85 90 95 



Page 2 



Mon0298 .ST25.txt 
Leu His Leu Gly Tyr Asp Leu Phe Asp Thr Cys Ser Glu Pro Met Val 
100 105 110 



Ala Met Lys Pro Ser Leu Val Phe Met Ala Lys Ala Gly Ser Cys Ser 
115 120 125 



lie Ala Ala Tyr Cys Asn Tyr Thr Gin Tyr Gin Pro Arg Val Leu Ala 
130 135 140 



Val lie Gly Pro His Ser Ser Glu Leu Ala Leu Val Thr Gly Lys Phe 
145 150 155 160 



Phe Ser Phe Phe Leu Val Pro Gin Val Ser Tyr Gly Ala Ser Thr Asp 
165 170 175 



Arg Leu Ser Asn Arg Glu lie Phe Pro Ser Phe Phe Arg Thr Val Pro 
180 185 190 



Ser Asp Gin Val Gin Val Ala Ala Met Val Glu Leu Leu Glu Glu Leu 
195 200 205 



Gly Trp Asn Trp Val Ala Ala Val Gly Ser Asp Asp Glu Tyr Gly Arg 
210 215 220 



Gin Gly Leu Ser Leu Phe Ser Gly Leu Ala Ser Ala Arg Gly lie Cys 
225 230 235 " 240 



lie Ala His Glu Gly Leu Val Pro Leu Pro Pro Gly Ser Leu Arg Leu 
245 250 255 



Gly Ala Leu Gin Gly Leu Leu Arg Gin Val Asn Gin Ser Ser Val Gin 
260 265 270 



Val Val Val Leu Phe Ser Ser Ala His Ala Ala Arg Thr Leu Phe Ser 
275 280 285 



Tyr Ser lie Arg Cys Lys Leu Ser Pro Lys Val Trp Val Ala Ser Glu 
290 " 295 300 



Ala Trp Leu Thr Ser Asp Leu Val Met Thr Leu Pro Gly Met Pro Gly 
305 310 315 320 



Val Gly Thr Val Leu Gly Phe Leu Gin Gin Gly Ala Pro Met Pro Glu 
325 330 335 



Phe Pro Ser Tyr Val Arg Thr Arg Leu Ala Leu Ala Ala Asp Pro Ala 
340 345 350 



Phe Cys Ala Ser Leu Asp Ala Glu Gin Pro Gly Leu Glu Glu His Val 
355 360 365 



Val Gly Pro Arg Cys Pro Gin Cys Asp His Val Thr Leu Glu Asn Leu 
370 375 380 
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Ser Ala Gly Leu Leu His His Gin Thr Phe Ala Ala Tyr Ala Ala Val 
385 390 395 400 



Tyr Gly Val Ala Gin Ala Leu His Asn Thr Leu Arg Cys Asn Ala Ser 
4 05 410 415 



Gly Cys Pro Arg Arg Glu Pro Val Arg Pro Trp Gin Leu Leu Glu Asn 
420 425 430 



Met Tyr Asn Val Ser Phe Arg Ala Arg Gly Leu Ala Leu Gin Phe Asp 
435 440 445 



Ala Ser Gly Asn Val Asn Val Asp Tyr Asp Leu Lys Leu Trp Val Trp 
450 455 ~ ■ 460 



Gin Asp Pro Thr Pro Glu Leu Arg Thr Val Gly Thr Phe Lys Gly Arg 
465 470 475 480 



Leu Glu Leu Trp Arg Ser Gin Met Cys Trp His Thr Pro Gly Lys Gin 
485 490 495 



Gin Pro Val Ser Gin Cys Ser Arg Gin Cys Lys Glu Gly Gin Val Arg 
500 505 " ~ 510 



Arg Val Lys Gly Phe His Ser Cys Cys Tyr Asn Cys Val Asp Cys Lys 
515 520 525 



Ala Gly Ser Tyr Gin Arg Asn Pro Asp Asp Leu Leu Cys Thr Gin Cys 
530 535 540 



Asp Gin Asp Gin Trp Ser Pro Asp Arg Ser Thr Arg Cys Phe Ala Arg 
545 550 555 " 560 



Lys Pro Met Phe Leu Ala Trp Gly Glu Pro Ala Val Leu Leu Leu Leu 
565 570 575 



Ala Leu Leu Ala Leu Ala Leu Gly Leu Ala Leu Ala Ala Leu Gly Leu 
580 585 590 



Phe Leu Trp His Ser Asp Ser Pro Leu Val Gin Ala Ser Gly Gly Pro 
595 600 605 



Arg Ala Cys Phe Gly Leu Ala Cys Leu Gly Leu Val Cys Leu Ser Val 
610 615 620 



Leu Leu Phe Pro Gly Gin Pro Gly Pro Ala Ser Cys Leu Ala Gin Gin 
625 630 635 ' 640 



Pro Leu Phe His Leu Pro Leu Thr Gly Cys Leu Ser Thr Phe Phe Leu 
645 650 655 
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Gin Ala Ala Glu lie Phe Val Gly Ser Glu Leu Pro Pro Ser Trp Ala 
660 665 670 



Glu Lys Met Arg Gly Arg Leu Arg Gly Pro Trp Ala Trp Leu Val Val 
675 680 685 

Leu Leu Ala Met Leu Ala Glu Ala Ala Leu Cys Ala Trp Tyr Leu Val 
690 695 ~ 700 

Ala Phe Pro Pro Glu Val Val Thr Asp Trp Arg Val Leu Pro Thr Glu 
705 710 715 720 

Ala Leu Val His Cys His Val His Ser Trp lie Ser Phe Gly Leu Val 
725 730 735 

His Ala Thr Asn Ala Met Leu Ala Phe Leu Cys Phe Leu Gly Thr Phe 
740 745 ~ 750 

Leu Val Gin Ser Arg Pro Gly Arg Tyr Asn Gly Ala Arg Gly Leu Thr 
755 760 " 765 

Phe Ala Met Leu Ala Tyr Phe lie Thr Trp lie Ser Phe Val Pro Leu 
770 775 J 780 

Phe Ala Asn Val His Val Ala Tyr Gin Pro Ala Val Gin Met Gly Thr 
785 790 795 800 

lie Leu Leu Cys Ala Leu Gly lie Leu Ala Thr Phe His Leu Pro Lys 
805 810 815 

Cys Tyr Leu Leu Leu Gin Arg Pro Glu Leu Asn Thr Pro Glu Phe Phe 
820 825 830 

Leu Glu Asp Asn Ala Arg Ala Gin Gly Ser Ser Trp Gly Gin Gly Arg 
835 840 " 845 

Gly Glu Ser Gly Gin Lys Gin Val Thr Pro Asp Pro Val Thr Ser Pro 
850 855 860 

Gin 
865 

<210> 3 

<211> 2532 

<212> DNA 

<213> Mus musculus 

<400> 3 

atgggacccc aggcgaggac actccatttg ctgtttctcc tgctgcatgc tctgcctaag 60 

ccagtcatgc tggtagggaa ctccgacttt cacctggctg gggactacct cctgggtggc 120 

ctctttaccc tccatgccaa cgtgaagagc gtctctcacc tcagctacct gcaggtgccc 180 

aagtgcaatg agtacaacat gaaggtcttg ggctacaacc tcatgcaggc catgcgattc 240 

Page 5 



Mon0298 .ST25.txt 



gccgtggagg 


aaatcaacaa 


ctgtagctct 


ctgctgcccg 


gcgtgctgct 


cggctacgag 


300 


atggtggatg 


tctgctacct 


ctccaacaat 


atccagcctg 


ggctctactt 


cctgtcacag 


360 


atagatgact 


tcctgcccat 


cctcaaagac 


tacagccagt 


acaggcccca 


agtggtggcc 


420 


gtcattggcc 


cagacaactc 


tgagtccgcc 


atcaccgtgt 


ccaacattct 


ctcctacttc 


430 


ctcgtgccac 


aggtcacata 


tagcgccatc 


accgacaagc 


tgcgagacaa 


gcggcgcttc 


540 


cctgccatgc 


tgcgcactgt 


gcccagcgcc 


acccaccaca 


tcgaggccat 


ggtgcaactg 


600 


atggttcact 


tccagtggaa 


ctggatcgtg 


gtgctggtga 


gcgatgacga 


ttatggccga 


660 


gagaacagcc 


acctgctgag 


ccagcgtctg 


accaacactg 


gcgatatctg 


cattgccttc 


720 


caggaggttc 


tgcctgtacc 


agaacccaac 


caggccgtga 


ggcctgagga 


gcaggaccaa 


780 


ctggacaaca 


tcctggacaa 


gctgcggcgg 


acctcggcgc 


gtgtggtggt 


gatattctcg 


840 


ccagagctga 


gcctgcacaa 


cttcttccgc 


gaggtgctgc 


gctggaactt 


cacaggcttt 


900 


gtgtggattg 


cctctgagtc 


ctgggccatc 


gaccctgttc 


tacacaacct 


cacagagctg 


960 


cgccacacgg 


gcactttcct 


gggcgtcacc 


atccagaggg 


tgtccatccc 


tggcttcagc 


1020 


cagttccgag 


tgcgccacga 


caagccagag 


tatcccatgc 


ctaacgagac 


cagcctgagg 


1080 


actacctgta 


accaggactg 


tgacgcctgc 


atgaacatca 


ccgagtcctt 


taacaacgtt 


1140 


ctcatgcttt 


cgggggagcg 


tgtggtctac 


agtgtgtact 


cggccgtcta 


cgcggtagcc 


1200 


cacaccctcc 


acagactcct 


ccactgcaac 


caggtccgct 


gcaccaagca 


aatcgtctat 


1260 


ccatggcagc 


tactcaggga 


gatctggcat 


gtcaacttca 


cgctcctggg 


caaccagctc 


1320 


ttcttcgacg 


aacaagggga 


catgccgatg 


ctcctggaca 


tcatccagtg 


gcaatggggc 


1380 


ctgagccaga 


accccttcca 


aagcatcgcc 


tcctactccc 


ccaccgagac 


gaggctgacc 


1440 


tacattagca 


atgtgtcctg 


gtacaccccc 


aacaacacgg 


tccccatatc 


catgtgttct 


1500 


aagagttgcc 


agcctgggca 


aatgaaaaaa 


cccataggcc 


tccacccgtg 


ctgcttcgag 


1560 


tgtgtggact 


gtccgccggg 


cacctacctc 


aaccgatcag 


tagatgagtt 


taactgtctg 


1620 


tcctgcccgg 


gttccatgtg 


gtcttacaag 


aacaacatcg 


cttgcttcaa 


gcggcggctg 


1680 


gccttcctgg 


agtggcacga 


agtgcccact 


atcgtggtga 


ccatcctggc 


cgccctgggc 


1740 


ttcatcagta 


cgctggccat 


tctgctcatc 


ttctggagac 


atttccagac 


gcccatggtg 


1800 


cgctcggcgg 


gcggccccat 


gtgcttcctg 


atgctggtgc 


ccctgctgct 


ggcgttcggg 


1860 


atggtccccg 


tgtatgtggg 


cccccccacg 


gtcttctcct 


gtttctgccg 


ccaggctttc 


1920 


ttcaccgttt 


gcttctccgt 


ctgcctctcc 


tgcatcacgg 


tgcgctcctt 


ccagattgtg 


1980 


tgcgtcttca 


agatggccag 


acgcctgcca 


agcgcctacg 


gtttctggat 


gcgttaccac 


2040 


gggccctacg 


tctttgtggc 


cttcatcacg 


gccgtcaagg 


tggccctggt 


ggcaggcaac 


2100 


atgctggcca 


ccaccatcaa 


ccccattggc 


cggaccgacc 


ccgatgaccc 


caatatcata 


2160 


atcctctcct 


gccaccctaa 


ctaccgcaac 


gggctactct 


tcaacaccag 


catggacttg 


2220 


ctgctgtccg 


tgctgggttt 


cagcttcgcg 


tacgtgggca 


aggaactgcc 


caccaactac 


2280 


aacgaagcca 


agttcatcac 


cctcagcatg 


accttctcct 


tcacctcctc 


catctccctc 


2340 
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ctggtcacca tcatggatct 


cctggtcact 


2400 


gtgctcaact 


ttctggccat 


cggcttgggg 


tactttggee 


ccaagtgtta 


catgatcctt 


2460 


ttctacccgg 


agcgcaacac 


L LL-Ciy L. LLaL 


ttcaatagca 


tgattcaggg 


c tacacgatg 




aggaagagct 


ag 










2532 


<210> 4 

<211> 2529 

<212> DNA 

<213> Rattus rattus 












<400> 4 
atgggtcccc 


aggcaaggac 


3r , 1~r , "t~nr'1~t"rr 

CI I — I - ^ Ly L. I-- L y 


ctgtctctcc 


tgctgcatgt 


L l _ Ly LL Ld.aU 


D U 


ccaggcaagc 


tggtagagaa 




cacctggccg 


gggactacct 


4— /~T f+ 4- f-f /-x 




ctctttaccc 


tccatgccaa 


<— y uy uay ay u 


atctcccacc 


tcagctacct 




loU 


aagtgcaatg 


agttcaccat 


y a ay y Ly LLy 


ggctacaacc 


teatgeagge 


LaLyuy LULL 


*c fl U 


gctgtggagg 


agatcaacaa 


c\~ ni~ net" rr 


ctgctacccg 


gcgtgctgct 


t_yy<_ idty dy 




atggtggatg 


tctgttacct 


L,^va.a^cio l- 


atccaccctg 


ggctctactt 


l. l, Lyy LdUdy 




gacgacgacc 


tcctgcccat 


^ v- L^u.o a. y d V— ' 


tacagecagt 


acatgcccca 


y uyy tyy l l 


/on 

*i __ U 


gtcattggcc 


ccgacaactc 


t gag t c eg cc 


attaccgtgt 


ccaacattct 




/Qfl 


ctcatcccac 


agatcacata 


^ a y y LCI l_ V — 


tccgacaagc 


tgcgggacaa 


yL.yyi_civ_i_ l l. 


s_i n 
o *. u 


cctagcatgc 


tacgcacagt 


gcccagcgcc 


acccaccaca 


tegaggecat 


y y ty Lay v> Ly 


60 0 


atggttcact 


tccaatggaa 


c tggattgtg 


gtgctggtga 


gegacgaega 


l. LaLy y LLy l 


660 


gagaacagcc 


acctgttgag 


ccagcgtct g 


accaaaacga 


gcgacatctg 


^ — Cl LLyLL L L L. 


72 0 


caggaggttc 


tgcccatacc 


tgagt ccagc 


caggtcatga 


ggtccgagga 


y Lay ay glgcl 


7 8 0 


ctggacaaca 


tcctggacaa 


getgeggegg 


acctcggcgc 


gcgtcgtggt 


yy Ly l tuLuy 


8 4 0 


cccgagctga 


gcctgtatag 


rt trM"1" car 


gaggtgetec 


gctggaactt 


i_.aL.yyy ll lu 


-7 u u 


gtgtggatcg 


cctctgagtc 


l. Ly y y l, lci l l. 


gacccagttc 


tgeataaect 


LaLyyayLLy 


you 


cgccacacgg 


gtacttttct 




atccagaggg 


tgtccatccc 


tyyuL LLdy l 


102 0 


cagttccgag 


tgcgccgtga 


^ clcl y w v ~ - d y y y 


tatcccgtgc 


ctaacacgac 


LdaLL Ly uyy 


iUoU 


acgacctgca 


accaggactg 


tgacgcctgc 


ttgaacacca 


ccaagtcctt 


L«aciL.aaLci lv_- 


1140 


cttatacttt 


cgggggagcg 




agcgtgtact 


eggcagttta 


uyuyy Ly y ll 




catgccctcc 


acagactcct 


eggctgt aac 


cgggtccgct 


gcaccaagca 


a aay y ll LaL 


12 60 


ccgtggcagc 


tactcaggga 


yuL^Lyy^a^ 


gtcaacttca 


cgctcctggg 


LaaLLyy LLL 


i -ion 


ttctttgacc 


aacaagggga 


ratrirpn'S'tTT 
LatyLLyaiy 


ctcttggaca 


tcatccagtg 


y Lay _yyydL 


i *5 q n 


ctgagccaga 


atcccttcca 


aagcatcgcc 


tcctattctc 


ccaccagcaa 


gaggctaacc 


1440 


tacattaaca 


atgtgtcctg 


gtacaccccc 


aacaacaegg 


tccctgtctc 


catgtgttcc 


1500 


aagagctgcc 


agccagggca 


aatgaaaaag 


tctgtgggcc 


tccacccttg 


ttgettcgag 


1560 


tgcttggatt 


gtatgccagg 


cacctacctc 


aaccgctcag 


cagatgagtt 


taactgtctg 


1620 


tcctgcccgg 


gttccatgtg 


gtcctacaag 


aacgacatca cttgcttcca 
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1680 
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accttcctgg 


agtggcacga 


agtgcccacc 


atcgtggtgg 


cca tact croc 




1740 


ttctt cagta 


cactggccat 


tcttttcatc 


ttctggagac 


atttccagac 


acccat ggtg 


1800 


cgct cggccg 


gtggccccat 


gtgcttcctg 


a tgc teg tgc 


ccc t get get 


yy *~y u *- <-yy y 


1860 


atggtgcccg 


tqtatqtqqq 


gccccccacg 


gtcttctcat 


get tetgecg 


acaggctttc 


1 920 


t tcaccgtct 


get t ctccat 


ctgcctatcc 


tgcat caccg 


tgc get cctt 


rpanat" rrrtrr 

v>_uv^ai Luy i_ y 


1980 


tgt gtcttca 


aga tggccag 


acgcctgcca 


a rr1~ etc ft arfl 
y .I — — 


nttt 1-trrrrat- 


rrr* ret - 1* a ppap 
y 1 — y — i_ a i_ ol 


2040 


gggccctatg 


tcttcgtggc 


cttcatcacg 


geca tcaagg 


t ggece t ggt 


yy i^yyyi_*aa<w 


2100 


a tgc tggcca 


ccaccatcaa 


c ccca t tggc 




rnciFi 1~ era ccc 


r , aar , ;=jt*r'at~rT 


2160 


atcctctcgt 


gccaccctaa 


ctaccgcaac 


gggctactgt 


tcaacaccag 


cat" crcra c 1 1" cr 


2220 


ctgctgtctg 


tgctgggttt 


cagcttcgct 


tacatgggca 


aggagctgee 


caccaactac 


2280 


aacgaagcca 


agttcatcac 


tctcagcatg 


accttctcct 


tcacctcctc 


catctccctc 


2340 


tgcaccttca 


tgtctgtgca 


egaeggegtg 


ctggtcacca 


tcatggacct 


cctggtcact 


2400 


gtgctcaact 


tcctggccat 


cyyo l uy y y a 


4— ^\ /--^ 4- -H 4- >-if n /~* 


ccaaguguta 


cat gat cctt 


• 2 4 60 


ttctacccgg 


agcgcaacac 


ctcagcctat 


ttcaatagca 


tgatccaggg 


ctacaccatg 


2520 


aggaagagc 












2529 


<210> 5 

<211> 2520 

<212> DNA 

<213> Homo sapiens 












<400> 5 
atggggccca 


gggcaaagac 


catctgctcc 


ctgttcttcc 


1" c c 1~ a t* n nrr t* 


i_-v_. iyy L- uy ay 


60 


ccggctgaga 


acteggaett 


ctacctgcct 


aaaaat' farr 

yyy y c*. ^ — ci \~- 




' — L, ^ U l_ \— . L- 


120 


ctccatgcca 


acatgaaggg 


cat tgt tcac 


cttaacttcc 


L-y v u.^u *_ y ^— - > — ■ 


cl i_ y l. y v_ aay 


180 


gagtatgaag 


tgaaggtgat 


aggctacaac 


etcatgeagg 


ccatgcgctt 


CCTPCTCT1" ncfFi n 

? y y ^ y y a. y 


240 


gagatcaaca 


atgacagcag 


cctgctgcct 


q o t a t cr c t ac 


't anact a 1" rra 
y yy a <_ y a 


y a i_ v_ y i_ y y a l. 


300 


gtgtgctaca 


tctccaacaa 


tgtccagccg 


gtgetctact 


tcctggcaca 


y ci y y a l_ a a 


360. 


ctccttccca 


tccaagagga 


ct acag t aac 


tacatttccc 


rrt" fit" rrrrt" rrrrr 1 
y i_y uy y L.y y 


«-y l _a l ty y l 


420 


cctgacaact 


ccgagtctgt 


pat'a^r"t~cr't~(~r 

—I y Ci l_. y ,_. y 






f r~" 1~ f ~ irrs 


d ft n 

1 o u 


cagatcacct 


acagcgccat 


cacrrcrat aacr 

■ a. M ^ y d i— y a y 


v_. y y a y u.u 


ay y uy ^y l l. l 


r , r*r , rT( r Tf~'+""t"1~rr 


540 


ctgcgtacca 


cacccagcgc 




y LL.Ljai|jy uua 


uyy i_ y a y l» t_ 


ya Lyt,tyL.dL 




ttccgctgga 


actggatcat 


tgtgctggtg 


ageagegaca 


cctatggccg 


cgacaatggc 


660 


cagctgcttg 


gcgagcgcgt 


ggcccggcgc 


gaeatctgea 


tcgccttcca 


ggagacgctg 


720 


cccacactgc 


agcccaacca 


gaacatgacg 


tcagaggagc 


gccagcgcct 


ggtgaccatt 


780 


gtggacaagc 


tgcagcagag 


cacagcgcgc 


gtcgtggtcg 


tgttctcgcc 


cgacctgacc 


840 


ctgtaccact 


tcttcaatga 


ggtgctgcgc 


cagaacttca 


cgggcgccgt 


gtggatcgcc 


900 


tccgagtcct 


gggecatega 


cccggtcctg 


cacaacctca 


cggagctggg 


ccacttgggc 


960 



Page 8 



Mon0298.ST25.txt 



accttcctgg 


gcatcaccat 


ccagagcgtg 


cccatcccgg 


gcttcagtga 


gttccgcgag 


1020 


tggggcccac 


aggctgggcc 


gccacccctc 


agcaggacca 


gccagagcta 


tacctgcaac 


1080 


caggagtgcg 


acaactgcct 


gaacgccacc 


ttgtccttca 


acaccattct 


caggctctct 


1140 


ggggagcgtg 


tcgtctacag 


cgtgtactct 


gcggtctatg 


ctgtggccca 


tgccctgcac 


1200 


agcctcctcg 


gctgtgacaa 


aagcacctgc 


accaagaggg 


tggtctaccc 


ctggcagctg 


1260 


cttgaggaga 


tctggaaggt 


caacttcact 


ctcctggacc 


accaaatctt 


cttcgacccg 


1320 


caaggggacg 


tggctctgca 


cttggagatt 


gtccagtggc 


aatgggaccg 


gagccagaat 


1380 


cccttccaga 


gcgtcgcctc 


ctactacccc 


ctgcagcgac 


agctgaagaa 


catccaagac 


1440 


atctcctggc 


acaccgtcaa 


caacacgatc 


cctatgtcca 


tgtgttccaa 


gaggtgccag 


1500 


tcagggcaaa 


agaagaagcc 


tgtgggcatc 


cacgtctgct 


gcttcgagtg 


catcgactgc 


1560 


cttcccggca 


ccttcctcaa 


ccacactgaa 


gatgaatatg 


aatgccaggc 


ctgcccgaat 


1620 


aacgagtggt 


cctaccagag 


tgagacctcc 


tgcttcaagc 


ggcagctggt 


cttcctggaa 


1680 


tggcatgagg 


cacccaccat 


cgctgtggcc 


ctgctggccg 


ccctgggctt 


cctcagcacc 


1740 


ctggccatcc 


tggtgatatt 


ctggaggcac 


ttccagacac 


ccatagttcg 


ctcggctggg 


1800 


ggccccatgt 


gcttcctgat 
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<213> Homo sapiens 












<400> 11 
atgctgggcc 


ctgctgtcct 


pt pt pi p* p* t~ p* a ptp* 


l l l iyyy lll. 


LLLLyLQLLL 


tgggacgggg 


DU 


gccccattgt 


gcctgtcaca 


rfrsflrt" 1~ .3 <^PT 
y l d d l l Lay y 


d Lyaay yyyy 


dLLdLy Lyui 


rtn rr /*y rx/^r rr "f~ 

yyyggggcLg 


i 9n 


ttccccctgg 


gcgaggccga 


y y ciy y 0 Ly y l 


l Luty taytL 


yy dLdLy ycL 


Cdy Cay lCCl 


1 0 u 


gtgtgcacca 


ggttctcctc 


ci 0. a L-y y w l* y 


* — ll, Ly y y lul 


Lyy LLaLyaQ 


aa l y y ll y Ly 


9 a n 


gaggagatca 


acaacaagtc 


frrr;R"tT , 1~rrf > 1~rT 
yyaL^uy^uy 


rrrnrrrrrfrrp 
LLLyyyt Ly u 


pr/" 1 t" ptpt pr p" +* ^ 
yLL LyyyLLa 


LydLLLL LLL 




gatacgtgct 


cggagcctgt 


cra"l~ncrr x r'^1 _ n 
y j ^-y y ^ ^— ' , — y 


day LLLay ll 


LLd Ly LLLLL 


yy LLddyyLa 




ggcagccgcg 


acatcgccgc 




LaLaLy Lay L 


aLLdy LLLLy 


Ly LyLLyyLL 


/ion 
^ \j 


gtcatcgggc 


cccactcgtc 


ay ayL. Lv^y 


a l y y LLaLLy 


rrra art!" fpl" f 
y La. ay llLll 


LdyLLLLLLL 




ctcatgcccc 


aggtcagcta 


l y y Ly l Lay l 


aLyyay L uyi, 


4~ or -5 or o» or 0* o» py 

LydyLyLLLy 


ggagaccttc 


D H \J 


ccctccttct 


tccgcaccgt 


y l LLay l yat 


Ly Ly Ly Lay l 


LyaLyyLLyu 


Ly Lggdy l Ly 


DU u 


ctgcaggagt 


tcggctggaa 


r~ , t~<TPTPTl - PfPTP' , P' 

l Ly y y iyy ll 


y lll Lyyyua 


yLyaLydLy a 


g taeggcegg 


DDU 


cagggcctga 


gcatcttctc 


y y 1 — 1 — " — Lyy ll 


PT P" Pf PTP" .P P* PTP" PT 

y Ly y LaLycy 


y L d LLLyLdL 


Ly Ly Ldcgag 


"7 9 n 


ggcctggtgc 


cgctgccccg 


Ly LLya l y d L 


LLy Ly y l Lyy 


it n a a t~xr~r "t - rrr*a 

y y day y uy La. 


ggacg tcctg 


1 ft n 

/ oil 


caccaggtga 


accagagcag 


cgtgcaggtg 


gtgctgctgt 


tcgcctccgt 


gcacgccgcc 


840 


cacgccctct 


tcaactacag 


catcagcagc 


aggctctcgc 


ccaaggtgtg 


ggtggccagc 


900 


gaggcctggc 


tgacctctga 


cctggtcatg 


gggctgcccg 


gcatggccca 


gatgggcacg 


960 


gtgcttggct 


tcctccagag 


gggtgcccag 


ctgcacgagt 


tcccccagta 


cgtgaagacg 


1020 


cacctggccc 


tggccaccga 


cccggccttc 


tgctctgccc 


tgggegagag 


ggagcagggt 


1080 


ctggaggagg 


acgtggtggg 


ccagcgctgc 


ccgcagtgtg 


actgcatcac 


getgeagaac 


1140 
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gtgagcgcag 


ggctaaatca 


ccaccagacg 


ttctctgtct 


acgcagctgt 


gtatagcgtg 


1200 


gcccaggccc 


tgcacaacac 


tcttcagtgc 


aacgcctcag 


gctgccccgc 


gcaggacccc 


1260 


gtgaagccct 


ggcagctcct 


ggagaacatg 


tacaacctga 


ccttccacgt 


gggcgggctg 


1320 


ccgctgcggt 


tcgacagcag 


cggaaacgtg 


gacatggagt 


acgacctgaa 


gctgtgggtg 


1380 


tggcagggct 


cagtgcccag 


gctccacgac 


gtgggcaggt 


tcaacggcag 


cctcaggaca 


1440 


gagcgcctga 


agatccgctg 


gcacacgtct 


gacaaccaga 


agcccgtgtc 


ccggtgctcg 


1500 


cggcagtgcc 


aggagggcca 


ggtgcgccgg 


gtcaaggggt 


tccactcctg 


ctgctacgac 


1560 


tgtgtggact 


gcgaggcggg 


cagctaccgg 


caaaacccag 


acgacatcgc 


ctgcaccttt 


1620 


tgtggccagg 


atgagtggtc 


cccggagcga 


agcacacgct 


gcttccgccg 


caggtctcgg 


1680 


ttcctggcat 


ggggcgagcc 


ggctgtgctg 


ctgctgctcc 


tgctgctgag 


cctggcgctg 


1740 


ggccttgtgc 


tggctgcttt 


ggggctgttc 


gttcaccatc 


gggacagccc 


actggttcag 


1800 


gcctcggggg 


ggcccctggc 


ctgctttggc 


ctggtgtgcc 


tgggcctggt 


ctgcctcagc 


1860 


gtcctcctgt 


tccctggcca 


gcccagccct 


gcccgatgcc 


tggcccagca 


gcccttgtcc 


1920 


cacctcccgc 


tcacgggctg 


cctgagcaca 


ctcttcctgc 


aggcggccga 


gatcttcgtg 


1980 


gagtcagaac 


tgcctctgag 


ctgggcagac 


cggctgagtg 


gctgcctgcg 


ggggccctgg 


2040 


gcctggctgg 


tggtgctgct 


ggccatgctg 


gtggaggtcg 


cactgtgcac 


ctggtacctg 


2100 


gtggccttcc 


cgccggaggt 


ggtgacggac 


tggcacatgc 


tgcccacgga 


ggcgctggtg 


2160 


cactgccgca 


cacgctcctg 


ggtcagcttc 


ggcctagcgc 


acgccaccaa 


tgccacgctg 


2220 


gcctttctct 


gcttcctggg 


cactttcctg 


gtgcggagcc 


agccgggccg 


ctacaaccgt 


2280 


gcccgtggcc 


tcacctttgc 


catgctggcc 


tacttcatca 


cctgggtctc 


ctttgtgccc 


2340 


ctcctggcca 


atgtgcaggt 


ggtcctcagg 


cccgccgtgc 


agatgggcgc 


cctcctgctc 


2400 


tgtgtcctgg 


gcatcctggc 


tgccttccac 


ctgcccaggt 


gttacctgct 


catgcggcag 


2460 


ccagggctca 


acacccccga 


gttcttcctg 


ggagggggcc 


ctggggatgc 


ccaaggccag 


2520 


aatgacggga 


acacaggaaa 


tcaggggaaa 


catgagtga 






2559 



<210> 12 

<211> 852 

<212> PRT 

<213> Homo sapiens 

<400> 12 

Met Leu Gly Pro Ala Val Leu Gly Leu Ser Leu Trp Ala Leu Leu His 
15 10 15 

Pro Gly Thr Gly Ala Pro Leu Cys Leu Ser Gin Gin Leu Arg Met Lys 
20 25 30 

Gly Asp Tyr Val Leu Gly Gly Leu Phe Pro Leu Gly Glu Ala Glu Glu 
35 40 45 

Ala Gly Leu Arg Ser Arg Thr Arg Pro Ser Ser Pro Val Cys Thr Arg 
50 55 60 
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Phe Ser Ser Asn Gly Leu Leu Trp Ala Leu Ala Met Lys Met Ala Val 
65 70 75 80 



Glu Glu lie Asn Asn Lys Ser Asp Leu Leu Pro Gly Leu Arg Leu Gly 
85 90 95 



Tyr Asp Leu Phe Asp Thr Cys Ser Glu Pro Val Val Ala Met Lys Pro 
100 105 110 



Ser Leu Met Phe Leu Ala Lys Ala Gly Ser Arg Asp lie Ala Ala Tyr 
115 120 125 



Cys Asn Tyr Thr Gin Tyr Gin Pro Arg Val Leu Ala Val lie Gly Pro 
130 135 140 



His Ser Ser Glu Leu Ala Met Val Thr Gly Lys Phe Phe Ser Phe Phe 
145 150 155 160 



Leu Met Pro Gin Val Ser Tyr Gly Ala Ser Met Glu Leu Leu Ser Ala 
165 170 175 



Arg Glu Thr Phe Pro Ser Phe Phe Arg Thr Val Pro Ser Asp Arg Val 
180 185 190 



Gin Leu Thr Ala Ala Ala Glu Leu Leu Gin Glu Phe Gly Trp Asn Trp 
195 200 205 



Val Ala Ala Leu Gly Ser Asp Asp Glu Tyr Gly Arg Gin Gly Leu Ser 
210 215 220 



lie Phe Ser Ala Leu Ala Ala Ala Arg Gly lie Cys lie Ala His Glu 
225 230 235 240 



Gly Leu Val Pro Leu Pro Arg Ala Asp Asp Ser Arg Leu Gly Lys Val 
245 250 255 



Gin Asp Val Leu His Gin Val Asn Gin Ser Ser Val Gin Val Val Leu 
260 265 270 



Leu Phe Ala Ser Val His Ala Ala His Ala Leu Phe Asn Tyr Ser lie 
275 280 285 



Ser Ser Arg Leu Ser Pro Lys Val Trp Val Ala Ser Glu Ala Trp Leu 
290 295 ■ 300 



Thr Ser Asp Leu Val Met Gly Leu Pro Gly Met Ala Gin Met Gly Thr 
305 310 " 315 320 



Val Leu Gly Phe Leu Gin Arg Gly Ala Gin Leu His Glu Phe Pro Gin 
325 330 335 
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Tyr Val Lys Thr His Leu Ala Leu Ala Thr Asp Pro Ala Phe Cys Ser 
340 345 350 



Ala Leu Gly Glu Arg Glu Gin Gly Leu Glu Glu Asp Val Val Gly Gin 
355 360 365 



Arg Cys Pro Gin Cys Asp Cys lie Thr Leu Gin Asn Val Ser Ala Gly 
370 375 380 



Leu Asn His His Gin Thr Phe Ser Val Tyr Ala Ala Val Tyr Ser Val 
385 390 395 " 400 



Ala Gin Ala Leu His Asn Thr Leu Gin Cys Asn Ala Ser Gly Cys Pro 
405 410 415 



Ala Gin Asp Pro Val Lys Pro Trp Gin Leu Leu Glu Asn Met Tyr Asn 
420 425 430 



Leu Thr Phe His Val Gly Gly Leu Pro Leu Arg Phe Asp Ser Ser Gly 
435 440 445 



Asn Val Asp Met Glu Tyr Asp Leu Lys Leu Trp Val Trp Gin Gly Ser 
450 455 460 



Val Pro Arg Leu His Asp Val Gly Arg Phe Asn Gly Ser Leu Arg Thr 
465 470 475 480 



Glu Arg Leu Lys lie Arg Trp His Thr Ser Asp Asn Gin Lys Pro Val 
485 490 495 



Ser Arg Cys Ser Arg Gin Cys Gin Glu Gly Gin Val Arg Arg Val Lys 
500 505 510 



Gly Phe His Ser Cys Cys Tyr Asp Cys Val Asp Cys Glu Ala Gly Ser 
515 520 525 



Tyr Arg Gin Asn Pro Asp Asp lie Ala Cys Thr Phe Cys Gly Gin Asp 
530 535 " 540 



Glu Trp Ser Pro Glu Arg Ser Thr Arg Cys Phe Arg Arg Arg Ser Arg 
545 550 555 560 



Phe Leu Ala Trp Gly Glu Pro Ala Val Leu Leu Leu Leu Leu Leu Leu 
565 570 575 



Ser Leu Ala Leu Gly Leu Val Leu Ala Ala Leu Gly Leu Phe Val His 
580 585 590 



His Arg Asp Ser Pro Leu Val Gin Ala Ser Gly Gly Pro Leu Ala Cys 
595 600 605 



Phe Gly Leu Val Cys Leu Gly Leu Val Cys Leu Ser Val Leu Leu Phe 
610 615 620 
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Pro Gly Gin Pro Ser Pro Ala Arg Cys Leu Ala Gin Gin Pro Leu Ser 
625 630 635 640 



His Leu Pro Leu Thr Gly Cys Leu Ser Thr Leu Phe Leu Gin Ala Ala 
645 650 655 



Glu lie Phe Val Glu Ser Glu Leu Pro Leu Ser Trp Ala Asp Arg Leu 
660 665 670 



Ser Gly Cys Leu Arg Gly Pro Trp Ala Trp Leu Val Val Leu Leu Ala 
675 680 685 



Met Leu Val Glu Val Ala Leu Cys Thr Trp Tyr Leu Val Ala Phe Pro 
690 695 700 



Pro Glu Val Val Thr Asp Trp His Met Leu Pro Thr Glu Ala Leu Val 
705 710 715 720 



His Cys Arg Thr Arg Ser Trp Val Ser Phe Gly Leu Ala His Ala Thr 
725 730 * 735 



Asn Ala Thr Leu Ala Phe Leu Cys Phe Leu Gly Thr Phe Leu Val Arg 
74 0 745 750 



Ser Gin Pro Gly Arg Tyr Asn Arg Ala Arg Gly Leu Thr Phe Ala Met 
755 760 765 



Leu Ala Tyr Phe lie Thr Trp Val Ser Phe Val Pro Leu Leu Ala Asn 
770 775 780 



Val Gin Val Val Leu Arg Pro Ala Val Gin Met Gly Ala Leu Leu Leu 
785 790 795 800 



Cys Val Leu Gly lie Leu Ala Ala Phe His Leu Pro Arg Cys Tyr Leu 
805 810 815 



Leu Met Arg Gin Pro Gly Leu Asn Thr Pro Glu Phe Phe Leu Gly Gly 
820 825 830 



Gly Pro Gly Asp Ala Gin Gly Gin Asn Asp Gly Asn Thr Gly Asn Gin 
835 840 845 



Gly Lys His Glu 
850 



<210> 13 

<211> 858 

<212> PRT 

<213> Mus musculus 

<400> 13 



Met Pro Ala Leu Ala lie Met Gly Leu Ser Leu Ala Ala Phe Leu Glu 
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10 15 



Leu Gly Met Gly Ala Ser Leu Cys Leu Ser Gin Gin Phe Lys Ala Gin 
20 25 30 



Gly Asp Tyr lie Leu Gly Gly Leu Phe Pro Leu Gly Ser Thr Glu Glu 
35 40 45 



Ala Thr Leu Asn Gin Arg Thr Gin Pro Asn Ser lie Pro Cys Asn Arg 
50 55 60 



Phe Ser Pro Leu Gly Leu Phe Leu Ala Met Ala Met Lys Met Ala Val 
65 70 75 80 



Glu Glu lie Asn Asn Gly Ser Ala Leu Leu Pro Gly Leu Arg Leu Gly 
85 ~ 90 ' 95 



Tyr Asp Leu Phe Asp Thr Cys Ser Glu Pro Val Val Thr Met Lys Ser 
100 105 110 



Ser Leu Met Phe Leu Ala Lys Val Gly Ser Gin Ser lie Ala Ala Tyr 
115 120 125 



Cys Asn Tyr Thr Gin Tyr Gin Pro Arg Val Leu Ala Val lie Gly Pro 
130 135 140 



His Ser Ser Glu Leu Ala Leu lie Thr Gly Lys Phe Phe Ser Phe Phe 
145 150 155 160 



Leu Met Pro Gin Val Ser Tyr Ser Ala Ser Met Asp Arg Leu Ser Asp 
165 170 175 



Arg Glu Thr Phe Pro Ser Phe Phe Arg Thr Val Pro Ser Asp Arg Val 
180 185 190 



Gin Leu Gin Ala Val Val Thr Leu Leu Gin Asn Phe Ser Trp Asn Trp 
195 200 205 



Val Ala Ala Leu Gly Ser Asp Asp Asp Tyr Gly Arg Glu Gly Leu Ser 
210 215 * ~ 220 



He Phe Ser Ser Leu Ala Asn Ala Arg Gly He Cys He Ala His Glu 
225 230 235 240 



Gly Leu Val Pro Gin His Asp Thr Ser Gly Gin Gin Leu Gly Lys Val 
245 250 255 



Leu Asp Val Leu Arg Gin Val Asn Gin Ser Lys Val Gin Val Val Val 
260 265 270 



Leu Phe Ala Ser Ala Arg Ala Val Tyr Ser Leu Phe Ser Tyr Ser He 
275 280 285 
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His His Gly Leu Ser Pro Lys Val Trp Val Ala Ser Glu Ser Trp Leu 
290 295 300 



Thr Ser Asp Leu Val Met Thr Leu Pro Asn lie Ala Arg Val Gly Thr 
305 310 315 320 



Val Leu Gly Phe Leu Gin Arg Gly Ala Leu Leu Pro Glu Phe Ser His 
325 330 335 



Tyr Val Glu Thr His Leu Ala Leu Ala Ala Asp Pro Ala Phe Cys Ala 
340 345 350 



Ser Leu Asn Ala Glu Leu Asp Leu Glu Glu His Val Met Gly Gin Arg 
355 360 365 



Cys Pro Arg Cys Asp Asp lie Met Leu Gin Asn Leu Ser Ser Gly Leu 
370 375 380 



Leu Gin Asn Leu Ser Ala Gly Gin Leu His His Gin lie Phe Ala Thr 
385 390 395 400 



Tyr Ala Ala Val Tyr Ser Val Ala Gin Ala Leu His Asn Thr Leu Gin 
405 410 415 



Cys Asn Val Ser His Cys His Val Ser Glu His Val Leu Pro Trp Gin 
420 425 430 



Leu Leu Glu Asn Met Tyr Asn Met Ser Phe His Ala Arg Asp Leu Thr 
435 440 445 



Leu Gin Phe Asp Ala Glu Gly Asn Val Asp Met Glu Tyr Asp Leu Lys 
450 455 460 



Met Trp Val Trp Gin Ser Pro Thr Pro Val Leu His Thr Val Gly Thr 
465 470 475 480 



Phe Asn Gly Thr Leu Gin Leu Gin Gin Ser Lys Met Tyr Trp Pro Gly 
485 4 90 4 95 



Asn Gin Val Pro Val Ser Gin Cys Ser Arg Gin Cys Lys Asp Gly Gin 
500 505 510 



Val Arg Arg Val Lys Gly Phe His Ser Cys Cys Tyr Asp Cys Val Asp 
515 520 525 



Cys Lys Ala Gly Ser Tyr Arg Lys His Pro Asp Asp Phe Thr Cys Thr 
530 535 540 



Pro Cys Asn Gin Asp Gin Trp Ser Pro Glu Lys Ser Thr Ala Cys Leu 
545 ~ 550 555 560 



Pro Arg Arg Pro Lys Phe Leu Ala Trp Gly Glu Pro Val Val Leu Ser 
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565 570 575 



Leu Leu Leu Leu Leu Cys Leu Val Leu Gly Leu Ala Leu Ala Ala Leu 
580 585 590 



Gly Leu Ser Val His His Trp Asp Ser Pro Leu Val Gin Ala Ser Gly 
595 600 605 



Gly Ser Gin Phe Cys Phe Gly Leu lie Cys Leu Gly Leu Phe Cys Leu 
610 615 620 



Ser Val Leu Leu Phe Pro Gly Arg Pro Ser Ser Ala Ser Cys Leu Ala 
625 630 635 640 



Gin Gin Pro Met Ala His Leu Pro Leu Thr Gly Cys Leu Ser Thr Leu 
645 650 " "* 655 



Phe Leu Gin Ala Ala Glu Thr Phe Val Glu Ser Glu Leu Pro Leu Ser 
660 665 670 



Trp Ala Asn Trp Leu Cys Ser Tyr Leu Arg Gly Leu Trp Ala Trp Leu 
675 68.0 685 



Val Val Leu Leu Ala Thr Phe Val Glu Ala Ala Leu Cys Ala Trp Tyr 
690 695 700 



Leu lie Ala Phe Pro Pro Glu Val Val Thr Asp Trp Ser Val Leu Pro 
705 710 715 720 



Thr Glu Val Leu Glu His Cys His Val Arg Ser Trp Val Ser Leu Gly 
725 730 * 735 



Leu Val His He Thr Asn Ala Met Leu Ala Phe Leu Cys Phe Leu Gly 
740 745 ~ 750 



Thr Phe Leu Val Gin Ser Gin Pro Gly Arg Tyr Asn Arg Ala Arg Gly 
755 760 765 



Leu Thr Phe Ala Met Leu Ala Tyr Phe He Thr Trp Val Ser Phe Val 
770 775 780 



Pro Leu Leu Ala Asn Val Gin Val Ala Tyr Gin Pro Ala Val Gin Met 
785 790 795 800 



Gly Ala He Leu Val Cys Ala Leu Gly He Leu Val Thr Phe His Leu 
805 810 815 



Pro Lys Cys Tyr Val Leu Leu Trp Leu Pro Lys Leu Asn Thr Gin Glu 
820 825 830 



Phe Phe Leu Gly Arg Asn Ala Lys Lys Ala Ala Asp Glu Asn Ser Gly 
835 840 845 
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Gly Gly Glu Ala Ala Gin Gly His Asn Glu 
850 855 



<210> 14 
<211> 858 
<212> PRT 

<213> Rattus rattus 
<400> 14 

Met Pro Gly Leu Ala lie Leu Gly Leu Ser Leu Ala Ala Phe Leu Glu 
15 10 15 



Leu Gly Met Gly Ser Ser Leu Cys Leu Ser Gin Gin Phe Lys Ala Gin 
20 25 30 



Gly Asp Tyr lie Leu Gly Gly Leu Phe Pro Leu Gly Thr Thr Glu Glu 
35 40 45 



Ala Thr Leu Asn Gin Arg Thr Gin Pro Asn Gly lie Leu Cys Thr Arg 
50 55 60 



Phe Ser Pro Leu Gly Leu Phe Leu Ala Met Ala Met Lys Met Ala Val 
65 70 75 80 



Glu Glu lie Asn Asn Gly Ser Ala Leu Leu Pro Gly Leu Arg Leu Gly 
85 90 95 



Tyr Asp Leu Phe Asp Thr Cys Ser Glu Pro Val Val Thr Met Lys Pro 
100 105 110 



Ser Leu Met Phe Met Ala Lys Val Gly Ser Gin Ser lie Ala Ala Tyr 
115 120 125 



Cys Asn Tyr Thr Gin Tyr Gin Pro Arg Val Leu Ala Val lie Gly Pro 
130 135 140 



His Ser Ser Glu Leu Ala Leu lie Thr Gly Lys Phe Phe Ser Phe Phe 
145 150 155 160 



Leu Met Pro Gin Val Ser Tyr Ser Ala Ser Met Asp Arg Leu Ser Asp 
165 170 175 



Arg Glu Thr Phe Pro Ser Phe Phe Arg Thr Val Pro Ser Asp Arg Val 
180 185 190 



Gin Leu Gin Ala Val Val Thr Leu Leu Gin Asn Phe Ser Trp Asn Trp 
195 200 205 



Val Ala Ala Leu Gly Ser Asp Asp Asp Tyr Gly Arg Glu Gly Leu Ser 
210 215 220 



lie Phe Ser Gly Leu Ala Asn Ser Arg Gly lie Cys lie Ala His Glu 
225 230 235 240 
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Gly Leu Val Pro Gin His Asp Thr Ser Gly Gin Gin Leu Gly Lys Val 
245 250 255 



Val Asp Val Leu Arg Gin Val Asn Gin Ser Lys Val Gin Val Val Val 
260 265 270 



Leu Phe Ala Ser Ala Arg Ala Val Tyr Ser Leu Phe Ser Tyr Ser lie 
275 280 285 



Leu His Asp Leu Ser Pro Lys Val Trp Val Ala Ser Glu Ser Trp Leu 
290 295 300 



Thr Ser Asp Leu Val Met Thr Leu Pro Asn lie Ala Arg Val Gly Thr 
305 310 315 320 



Val Leu Gly Phe Leu Gin Arg Gly Ala Leu Leu Pro Glu Phe Ser His 
325 330 335 



Tyr Val Glu Thr Arg Leu Ala Leu Ala Ala Asp Pro Thr Phe Cys Ala 
340 345 350 



Ser Leu Lys Ala Glu Leu Asp Leu Glu Glu Arg Val Met Gly Pro Arg 
355 360 365 



Cys Ser Gin Cys Asp Tyr lie Met Leu Gin Asn Leu Ser Ser Gly Leu 
370 375 380 



Met Gin Asn Leu Ser Ala Gly Gin Leu His His Gin lie Phe Ala Thr 
385 390 395 400 



Tyr Ala Ala Val Tyr Ser Val Ala Gin Ala Leu His Asn Thr Leu Gin 
405 410 415 



Cys Asn Val Ser His Cys His Thr Ser Glu Pro Val Gin Pro Trp Gin 
420 425 430 



Leu Leu Glu Asn Met Tyr Asn Met Ser Phe Arg Ala Arg Asp Leu Thr 
435 440 445 



Leu Gin Phe Asp Ala Lys Gly Ser Val Asp Met Glu Tyr Asp Leu Lys 
450 455 460 



Met Trp Val Trp Gin Ser Pro Thr Pro Val Leu His Thr Val Gly Thr 
465 470 475 480 



Phe Asn Gly Thr Leu Gin Leu Gin His Ser Lys Met Tyr Trp Pro Gly 
485 490 " 495 



Asn Gin Val Pro Val Ser Gin Cys Ser Arg Gin Cys Lys Asp Gly Gin 
500 505 510 
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Val Arg Arg Val Lys Gly Phe His Ser Cys Cys Tyr Asp Cys Val Asp 
515 520 525 



Cys Lys Ala Gly Ser Tyr Arg Lys His Pro Asp Asp Phe Thr Cys Thr 
530 535 540 



Pro Cys Gly Lys Asp Gin Trp Ser Pro Glu Lys Ser Thr Thr Cys Leu 
545 550 555 560 



Pro Arg Arg Pro Lys Phe Leu Ala Trp Gly Glu Pro Ala Val Leu Ser 
565 570 575 



Leu Leu Leu Leu Leu Cys Leu Val Leu Gly Leu Thr Leu Ala Ala Leu 
580 " 585 ' 590 



Gly Leu Phe Val His Tyr Trp Asp Ser Pro Leu Val Gin Ala Ser Gly 
595 600 605 



Gly Ser Leu Phe Cys Phe Gly Leu lie Cys Leu Gly Leu Phe Cys Leu 
610 615 620 



Ser Val Leu Leu Phe Pro Gly Arg Pro Arg Ser Ala Ser Cys Leu Ala 
625 630 ~ 635 640 



Gin Gin Pro Met Ala His Leu Pro Leu Thr Gly Cys Leu Ser Thr Leu 
645 650 655 



Phe Leu Gin Ala Ala Glu lie Phe Val Glu Ser Glu Leu Pro Leu Ser 
660 665 670 



Trp Ala Asn Trp Leu Cys Ser Tyr Leu Arg Gly Pro Trp Ala Trp Leu 
675 680 " 685 



Val Val Leu Leu Ala Thr Leu Val Glu Ala Ala Leu Cys Ala Trp Tyr 
690 695 700 



Leu Met Ala Phe Pro Pro Glu Val Val Thr Asp Trp Gin Val Leu Pro 
705 710 715 720 



Thr Glu Val Leu Glu His Cys Arg Met Arg Ser Trp Val Ser Leu Gly 
725 730 735 



Leu Val His lie Thr Asn Ala Val Leu Ala Phe Leu Cys Phe Leu Gly 
740 745 750 



Thr Phe Leu Val Gin Ser Gin Pro Gly Arg Tyr Asn Arg Ala Arg Gly 
755 760 765 



Leu Thr Phe Ala Met Leu Ala Tyr Phe lie lie Trp Val Ser Phe Val 
770 775 780 



Pro Leu Leu Ala Asn Val Gin Val Ala Tyr Gin Pro Ala Val Gin Met 
785 790 795 800 
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Gly Ala lie Leu Phe Cys Ala Leu Gly lie Leu Ala Thr Phe His Leu 
805 810 815 



Pro Lys Cys Tyr Val Leu Leu Trp Leu Pro Glu Leu Asn Thr Gin Glu 
820 825 830 



Phe Phe Leu Gly Arg Ser Pro Lys Glu Ala Ser Asp Gly Asn Ser Gly 
835 840 845 



Ser Ser Glu Ala Thr Arg Gly His Ser Glu 
850 855 



<210> 15 

<211> 842 

<212> PRT 

<213> Mus musculus 

<400> 15 

Met Leu Phe Trp Ala Ala His Leu Leu Leu Ser Leu Gin Leu Ala Val 
15 10 15 



Ala Tyr Cys Trp Ala Phe Ser Cys Gin Arg Thr Glu Ser Ser Pro Gly 
20 25 30 



Phe Ser Leu Pro Gly Asp Phe Leu Leu Ala Gly Leu Phe Ser Leu His 
35 40 45 



Ala Asp Cys Leu Gin Val Arg His Arg Pro Leu Val Thr Ser Cys Asp 
50 55 60 



Arg Ser Asp Ser Phe Asn Gly His Gly Tyr His Leu Phe Gin Ala Met 
65 70 75 80 



Arg Phe Thr Val Glu Glu lie Asn Asn Ser Thr Ala Leu Leu Pro Asn 
85 90 95 



lie Thr Leu Gly Tyr Glu Leu Tyr Asp Val Cys Ser Glu Ser Ser Asn 
100 105 110 



Val Tyr Ala Thr Leu Arg Val Leu Ala Gin Gin Gly Thr Gly His Leu 
115 120 125 



Glu Met Gin Arg Asp Leu Arg Asn His Ser Ser Lys Val Val Ala Leu 
130 135 140 



lie Gly Pro Asp Asn Thr Asp His Ala Val Thr Thr Ala Ala Leu Leu 
145 150 155 160 



Ser Pro Phe Leu Met Pro Leu Val Ser Tyr Glu Ala Ser Ser Val lie 
165 170 175 



Leu Ser Gly Lys Arg Lys Phe Pro Ser Phe Leu Arg Thr lie Pro Ser 
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180 185 190 



Asp Lys Tyr Gin Val Glu Val lie Val Arg Leu Leu Gin Ser Phe Gly 
195 200 205 



Trp Val Trp lie Ser Leu Val Gly Ser Tyr Gly Asp Tyr Gly Gin Leu 
210 215 220 



Gly Val Gin Ala Leu Glu Glu Leu Ala Thr Pro Arg Gly lie Cys Val 
225 230 235 ~ 240 



Ala Phe Lys Asp Val Val Pro Leu Ser Ala Gin Ala Gly Asp Pro Arg 
245 250 255 



Met Gin Arg Met Met Leu Arg Leu Ala Arg Ala Arg Thr Thr Val Val 
260 265 ~ 270 



Val Val Phe Ser Asn Arg His Leu Ala Gly Val Phe Phe Arg Ser Val 
275 280 285 



Val Leu Ala Asn Leu Thr Gly Lys Val Trp lie Ala Ser Glu Asp Trp 
2 90 2 95 300 



Ala lie Ser Thr Tyr lie Thr Asn Val Pro Gly He Gin Gly He Gly 
305 310 315 320 



Thr Val Leu Gly Val Ala He Gin Gin Arg Gin Val Pro Gly Leu Lys 
325 330 335 



Glu Phe Glu Glu Ser Tyr Val Gin Ala Val Met Gly Ala Pro Arg Thr 
340 345 350 



Cys Pro Glu Gly Ser Trp Cys Gly Thr Asn Gin Leu Cys Arg Glu Cys 
355 360 365 



His Ala Phe Thr Thr Trp Asn Met Pro Glu Leu Gly Ala Phe Ser Met 
370 375 380 



Ser Ala Ala Tyr Asn Val Tyr Glu Ala Val Tyr Ala Val Ala His Gly 
385 390 395 400 



Leu His Gin Leu Leu Gly Cys Thr Ser Gly Thr Cys Ala Arg Gly Pro 
405 410 415 



Val Tyr Pro Trp Gin Leu Leu Gin Gin He Tyr Lys Val Asn Phe Leu 
420 425 430 



Leu His Lys Lys Thr Val Ala Phe Asp Asp Lys Gly Asp Pro Leu Gly 
435 440 " 445 



Tyr Tyr Asp He He Ala Trp Asp Trp Asn Gly Pro Glu Trp Thr Phe 
450 455 460 
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Glu Val He Gly Ser Ala Ser Leu Ser Pro Val His Leu Asp He Asn 
465 470 475 ' 480 



Lys Thr Lys He Gin Trp His Gly Lys Asn Asn Gin Val Pro Val Ser 
485 490 495 



Val Cys Thr Arg Asp Cys Leu Glu Gly His His Arg Leu Val Met Gly 
500 505 510 



Ser His His Cys Cys Phe Glu Cys Met Pro Cys Glu Ala Gly Thr Phe 
515 520 525 



Leu Asn Thr Ser Glu Leu His Thr Cys Gin Pro Cys Gly Thr Glu Glu 
530 535 540 



Trp Ala Pro Glu Gly Ser Ser Ala Cys Phe Ser Arg Thr Val Glu Phe 
545 550 555 560 



Leu Gly Trp His Glu Pro He Ser Leu Val Leu Leu Ala Ala Asn Thr 
565 570 575 



Leu Leu Leu Leu Leu Leu He Gly Thr Ala Gly Leu Phe Ala Trp Arg 
580 585 590 



Leu His Thr Pro Val Val Arg Ser Ala Gly Gly Arg Leu Cys Phe Leu 
595 600 605 



Met Leu Gly Ser Leu Val Ala Gly Ser Cys Ser Leu Tyr Ser Phe Phe 
610 615 " 620 



Gly Lys Pro Thr Val Pro Ala Cys Leu Leu Arg Gin Pro Leu Phe Ser 
625 630 635 640 



Leu Gly Phe Ala He Phe Leu Ser Cys Leu Thr He Arg Ser Phe Gin 
645 " 650 655 



Leu Val He lie Phe Lys Phe Ser Thr Lys Val Pro Thr Phe Tyr His 
660 665 670 



Thr Trp Ala Gin Asn His Gly Ala Gly lie Phe Val He Val Ser Ser 
675 680 685 



Thr Val His Leu Phe Leu Cys Leu Thr Trp Leu Ala Met Trp Thr Pro 
690 695 700 



Arg Pro Thr Arg Glu Tyr Gin Arg Phe Pro His Leu Val lie Leu Glu 
705 710 715 720 



Cys Thr Glu Val Asn Ser Val Gly Phe Leu Val Ala Phe Ala His Asn 
725 730 735 



He Leu Leu Ser lie Ser Thr Phe Val Cys Ser Tyr Leu Gly Lys Glu 
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740 745 750 



Leu Pro Glu Asn Tyr Asn Glu Ala Lys Cys Val Thr Phe Ser Leu Leu 
755 760 765 



Leu His Phe Val Ser Trp He Ala Phe Phe Thr Met Ser Ser He Tyr 
770 775 780 



Gin Gly Ser Tyr Leu Pro Ala Val Asn Val Leu Ala Gly Leu Ala Thr 
785 790 795 800 



Leu Ser Gly Gly Phe Ser Gly Tyr Phe Leu Pro Lys Cys Tyr Val He 
805 810 815 



Leu Cys Arg Pro Glu Leu Asn Asn Thr Glu His Phe Gin Ala Ser He 
820 825 830 



Gin Asp Tyr Thr Arg Arg Cys Gly Thr Thr 
835 840 



<210> 16 
<211> 840 
<212> PRT 

<213> Rattus rattus 
<400> 16 

Met Leu Phe Trp Ala Ala His Leu Leu Leu Ser Leu Gin Leu Val Tyr 
15 10 15 



Cys Trp Ala Phe Ser Cys Gin Arg Thr Glu Ser Ser Pro Gly Phe Ser 
20 25 30 



Leu Pro Gly Asp Phe Leu Leu Ala Gly Leu Phe Ser Leu His Gly Asp 
35 40 45 



Cys Leu Gin Val Arg His Arg Pro Leu Val Thr Ser Cys Asp Arg Pro 
50 55 60 



Asp Ser Phe Asn Gly His Gly Tyr His Leu Phe Gin Ala Met Arg Phe 
65 70 75 80 



Thr Val Glu Glu He Asn Asn Ser Ser Ala Leu Leu Pro Asn He Thr 
85 90 95 



Leu Gly Tyr Glu Leu Tyr Asp Val Cys Ser Glu Ser Ala Asn Val Tyr 
100 105 110 



Ala Thr Leu Arg Val Leu Ala Leu Gin Gly Pro Arg His He Glu He 
115 120 125 



Gin Lys Asp Leu Arg Asn His Ser Ser Lys Val Val Ala Phe He Gly 
130 " 135 ' 140 
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Pro Asp Asn Thr Asp His Ala Val Thr Thr Ala Ala Leu Leu Gly Pro 
145 150 155 160 



Phe Leu Met Pro Leu Val Ser Tyr Glu Ala Ser Ser Val Val Leu Ser 
165 170 175 



Ala Lys Arg Lys Phe Pro Ser Phe Leu Arg Thr Val Pro Ser Asp Arg 
180 185 ~ 190 



His Gin Val Glu Val Met Val Gin Leu Leu Gin Ser Phe Gly Trp Val 
195 2 00 205 



Trp lie Ser Leu lie Gly Ser Tyr Gly Asp Tyr Gly Gin Leu Gly Val 
210 215 220 



Gin Ala Leu Glu Glu Leu Ala Val Pro Arg Gly lie Cys Val Ala Phe 
225 230 235 * 240 



Lys Asp lie Val Pro Phe Ser Ala Arg Val Gly Asp Pro Arg Met Gin 
245 250 ' 255 



Ser Met Met Gin His Leu Ala Gin Ala Arg Thr Thr Val Val Val Val 

260 265 270 



Phe Ser Asn Arg His Leu Ala Arg Val Phe Phe Arg Ser Val Val Leu 
275 280 285 



Ala Asn Leu Thr Gly Lys Val Trp Val Ala Ser Glu Asp Trp Ala lie 
290 295 300 



Ser Thr Tyr He Thr Ser Val Thr Gly He Gin Gly He Gly Thr Val 
305 310 315 320 



Leu Gly Val Ala Val Gin Gin Arg Gin Val Pro Gly Leu Lys Glu Phe 
325 330 * 335 



Glu Glu Ser Tyr Val Arg Ala Val Thr Ala Ala Pro Ser Ala Cys Pro 
340 345 350 



Glu Gly Ser Trp Cys Ser Thr Asn Gin Leu Cys Arg Glu Cys His Thr 
355 360 365 



Phe Thr Thr Arg Asn Met Pro Thr Leu Gly Ala Phe Ser Met Ser Ala 
370 375 380 



Ala Tyr Arg Val Tyr Glu Ala Val Tyr Ala Val Ala His Gly Leu His 
385 390 395 400 



Gin Leu Leu Gly Cys Thr Ser Glu He Cys Ser Arg Gly Pro Val Tyr 
405 410 415 



Pro Trp Gin Leu Leu Gin Gin He Tyr Lys Val Asn Phe Leu Leu His 
420 425 430 
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Glu Asn Thr Val Ala Phe Asp Asp Asn Gly Asp Thr Leu Gly Tyr Tyr 
435 440 445 



Asp lie lie Ala Trp Asp Trp Asn Gly Pro Glu Trp Thr Phe Glu lie 
450 455 460 



He Gly Ser Ala Ser Leu Ser Pro Val His Leu Asp He Asn Lys Thr 
465 470 475 ' 480 



Lys He Gin Trp His Gly Lys Asn Asn Gin Val Pro Val Ser Val Cys 
485 490 495 



Thr Thr Asp Cys Leu Ala Gly His His Arg Val Val Val Gly Ser His 
500 505 510 



His Cys Cys Phe Glu Cys Val Pro Cys Glu Ala Gly Thr Phe Leu Asn 
515 520 525 



Met Ser Glu Leu His lie Cys Gin Pro Cys Gly Thr Glu Glu Trp Ala 
530 535 540 



Pro Lys Glu Ser Thr Thr Cys Phe Pro Arg Thr Val Glu Phe Leu Ala 
545 550 555 560 



Trp His Glu Pro He Ser Leu Val Leu He Ala Ala Asn Thr Leu Leu 
565 570 575 



Leu Leu Leu Leu Val Gly Thr Ala Gly Leu Phe Ala Trp His Phe His 
580 585 " 590 



Thr Pro Val Val Arg Ser Ala Gly Gly Arg Leu Cys Phe Leu Met Leu 
595 600 605 



Gly Ser Leu Val Ala Gly Ser Cys Ser Phe Tyr Ser Phe Phe Gly Glu 
610 615 620 



Pro Thr Val Pro Ala Cys Leu Leu Arg Gin Pro Leu Phe Ser Leu Gly 
625 630 635 640 



Phe Ala He Phe Leu Ser Cys Leu Thr He Arg Ser Phe Gin Leu Val 
645 650 655 



He He Phe Lys Phe Ser Thr Lys Val Pro Thr Phe Tyr Arg Thr Trp 
660 ~* 665 670 



Ala Gin Asn His Gly Ala Gly Leu Phe Val He Val Ser Ser Thr Val 
675 680 685 



His Leu Leu He Cys Leu Thr Trp Leu Val Met Trp Thr Pro Arg Pro 
690 " 695 ~ 700 
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Thr Arg Glu Tyr Gin Arg Phe Pro His Leu Val lie Leu Glu Cys Thr 
705 710 715 " 720 



Glu Val Asn Ser Val Gly Phe Leu Leu Ala Phe Thr His Asn lie Leu 
725 730 735 



Leu Ser lie Ser Thr Phe Val Cys Ser Tyr Leu Gly Lys Glu Leu Pro 
740 745 750 



Glu Asn Tyr Asn Glu Ala Lys Cys Val Thr Phe Ser Leu Leu Leu Asn 
755 760 765 



Phe Val Ser Trp He Ala Phe Phe Thr Met Ala Ser He Tyr Gin Gly 
770 775 780 



Ser Tyr Leu Pro Ala Val Asn Val Leu Ala Gly Leu Thr Thr Leu Ser 
785 790 795 800 



Gly Gly Phe Ser Gly Tyr Phe Leu Pro Lys Cys Tyr Val He Leu Cys 
805 810 815 



Arg Pro Glu Leu Asn Asn Thr Glu His Phe Gin Ala Ser He Gin Asp 
820 825 830 



Tyr Thr Arg Arg Cys Gly Thr Thr 
835 840 



<210> 17 

<211> 841 

<212> PRT 

<213> Homo sapiens 

<400> 17 

Met Leu Leu Cys Thr Ala Arg Leu Val Gly Leu Gin Leu Leu He Ser 
15 10 15 



Cys Cys Trp Ala Phe Ala Cys His Ser Thr Glu Ser Ser Pro Asp Phe 
20 25 30 



Thr Leu Pro Gly Asp Tyr Leu Leu Ala Gly Leu Phe Pro Leu His Ser 
35 40 * 45 



Gly Cys Leu Gin Val Arg His Arg Pro Glu Val Thr Leu Cys Asp Arg 
5 0 55 60 



Ser Cys Ser Phe Asn Glu His Gly Tyr His Leu Phe Gin Ala Met Arg 
65 70 75 80 



Leu Gly Val Glu Glu He Asn Asn Ser Thr Ala Leu Leu Pro Asn lie 
85 90 95 



Thr Leu Gly Tyr Gin Leu Tyr Asp Val Cys Ser Asp Ser Ala Asn Val 
100 105 110 
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Tyr Ala Thr Leu Arg Val Leu Ser Leu Pro Gly Gin His His lie Glu 
115 120 125 



Leu Gin Gly Asp Leu Leu His Tyr Ser Pro Thr Val Leu Ala Val lie 
130 135 140 



Gly Pro Asp Ser Thr Asn Arg Ala Ala Thr Thr Ala Ala Leu Leu Ser 
145 150 155 160 



Pro Phe Leu Val Pro Met lie Ser Tyr Ala Ala Ser Ser Glu Thr Leu 
165 170 175 



Ser Val Lys Arg Gin Tyr Pro Ser Phe Leu Arg Thr lie Pro Asn Asp 
180 185 190 



Lys Tyr Gin Val Glu Thr Met Val Leu Leu Leu Gin Lys Phe Gly Trp 
195 200 205 



Thr Trp lie Ser Leu Val Gly Ser Ser Asp Asp Tyr Gly Gin Leu Gly 
210 215 ~ 220 



Val Gin Ala Leu Glu Asn Gin Ala Thr Gly Gin Gly He Cys He Ala 
225 230 235 240 



Phe Lys Asp He Met Pro Phe Ser Ala Gin Val Gly Asp Glu Arg Met 
245 250 255 



Gin Cys Leu Met Arg His Leu Ala Gin Ala Gly Ala Thr Val Val Val 
2 60 2 65 270 



Val Phe Ser Ser Arg Gin Leu Ala Arg Val Phe Phe Glu Ser Val Val 
275 280 285 



Leu Thr Asn Leu Thr Gly Lys Val Trp Val Ala Ser Glu Ala Trp Ala 
290 295 300 



Leu Ser Arg His He Thr Gly Val Pro Gly He Gin Arg He Gly Met 
305 310 315 320 



Val Leu Gly Val Ala He Gin Lys Arg Ala Val Pro Gly Leu Lys Ala 
325 330 335 



Phe Glu Glu Ala Tyr Ala Arg Ala Asp Lys Lys Ala Pro Arg Pro Cys 
340 345 350 



His Lys Gly Ser Trp Cys Ser Ser Asn Gin Leu Cys Arg Glu Cys Gin 
355 360 365 



Ala Phe Met Ala His Thr Met Pro Lys Leu Lys Ala Phe Ser Met Ser 
370 375 380 



Ser Ala Tyr Asn Ala Tyr Arg Ala Val Tyr Ala Val Ala His Gly Leu 
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385 390 395 400 



His Gin Leu Leu Gly Cys Ala Ser Gly Ala Cys Ser Arg Gly Arg Val 
405 410 415 



Tyr Pro Trp Gin Leu Leu Glu Gin lie His Lys Val His Phe Leu Leu 
420 425 430 



His Lys Asp Thr Val Ala Phe Asn Asp Asn Arg Asp Pro Leu Ser Ser 
435 440 445 



Tyr Asn lie lie Ala Trp Asp Trp Asn Gly Pro Lys Trp Thr Phe Thr 
450 455 460 



Val Leu Gly Ser Ser Thr Trp Ser Pro Val Gin Leu Asn lie Asn Glu 
465 470 475 480 



Thr Lys lie Gin Trp His Gly Lys Asp Asn Gin Val Pro Lys Ser Val 
485 " 490 495 



Cys Ser Ser Asp Cys Leu Glu Gly His Gin Arg Val Val Thr Gly Phe 
500 505 510 



His His Cys Cys Phe Glu Cys Val Pro Cys Gly Ala Gly Thr Phe Leu 
515 520 525 



Asn Lys Ser Asp Leu Tyr Arg Cys Gin Pro Cys Gly Lys Glu Glu Trp 
530 535 54 0 



Ala Pro Glu Gly Ser Gin Thr Cys Phe Pro Arg Thr Val Val Phe Leu 
545 550 555 560 



Ala Leu Arg Glu His Thr Ser Trp Val Leu Leu Ala Ala Asn Thr Leu 
565 570 575 



Leu Leu Leu Leu Leu Leu Gly Thr Ala Gly Leu Phe Ala Trp His Leu 
580 585 590 



Asp Thr Pro Val Val Arg Ser Ala Gly Gly Arg Leu Cys Phe Leu Met 
595 600 605 



Leu Gly Ser Leu Ala Ala Gly Ser Gly Ser Leu Tyr Gly Phe Phe Gly 
610 615 " 620 



Glu Pro Thr Arg Pro Ala Cys Leu Leu Arg Gin Ala Leu Phe Ala Leu 
625 630 635 640 



Gly Phe Thr lie Phe Leu Ser Cys Leu Thr Val Arg Ser Phe Gin Leu 
645 650 655 



lie lie lie Phe Lys Phe Ser Thr Lys Val Pro Thr Phe Tyr His Ala 
660 665 670 
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Trp Val Gin Asn His Gly Ala Gly Leu Phe Val Met lie Ser Ser Ala 
675 680 685 



Ala Gin Leu Leu lie Cys Leu Thr Trp Leu Val Val Trp Thr Pro Leu 
690 695 700 



Pro Ala Arg Glu Tyr Gin Arg Phe Pro His Leu Val Met Leu Glu Cys 
705 710 715 720 



Thr Glu Thr Asn Ser Leu Gly Phe lie Leu Ala Phe Leu Tyr Asn Gly 
725 730 735 



Leu Leu Ser lie Ser Ala Phe Ala Cys Ser Tyr Leu Gly Lys Asp Leu 
740 745 750 



Pro Glu Asn Tyr Asn Glu Ala Lys Cys Val Thr Phe Ser Leu Leu Phe 
755 760 765 



Asn Phe Val Ser Trp lie Ala Phe Phe Thr Thr Ala Ser Val Tyr Asp 
770 775 780 



Gly Lys Tyr Leu Pro Ala Ala Asn Met Met Ala Gly Leu Ser Ser Leu 
785 " 790 795 ' 800 



Ser Ser Gly Phe Gly Gly Tyr Phe Leu Pro Lys Cys Tyr Val lie Leu 
805 810 815 



Cys Arg Pro Asp Leu Asn Ser Thr Glu His Phe Gin Ala Ser lie Gin 
820 825 830 



Asp Tyr Thr Arg Arg Cys Gly Ser Thr 
835 840 



<210> 18 

<211> 843 

<212> PRT 

<213> Mus musculus 

<400> 18 

Met Gly Pro Gin Ala Arg Thr Leu His Leu Leu Phe Leu Leu Leu His 
15 10 15 



Ala Leu Pro Lys Pro Val Met Leu Val Gly Asn Ser Asp Phe His Leu 
20 25 30 



Ala Gly Asp Tyr Leu Leu Gly Gly Leu Phe Thr Leu His Ala Asn Val 
35 40 45 



Lys Ser Val Ser His Leu Ser Tyr Leu Gin Val Pro Lys Cys Asn Glu 
50 55 60 



Tyr Asn Met Lys Val Leu Gly Tyr Asn Leu Met Gin Ala Met Arg Phe 
65 70 75 80 



Page 36 



Mon0298.ST25.txt 



Ala Val Glu Glu lie Asn Asn Cys Ser Ser Leu Leu Pro Gly Val Leu 
85 90 95 



Leu Gly Tyr Glu Met Val Asp Val Cys Tyr Leu Ser Asn Asn lie Gin 
100 105 110 



Pro Gly Leu Tyr Phe Leu Ser Gin lie Asp Asp Phe Leu Pro lie Leu 
115 120 125 



Lys Asp Tyr Ser Gin Tyr Arg Pro Gin Val Val Ala Val lie Gly Pro 
130 135 140 



Asp Asn Ser Glu Ser Ala lie Thr Val Ser Asn lie Leu Ser Tyr Phe 
145 150 155 160 



Leu Val Pro Gin Val Thr Tyr Ser Ala lie Thr Asp Lys Leu Arg Asp 
165 170 175 



Lys Arg Arg Phe Pro Ala Met Leu Arg Thr Val Pro Ser Ala Thr His 
180 185 190 



His lie Glu Ala Met Val Gin Leu Met Val His Phe Gin Trp Asn Trp 
195 200 205 



lie Val Val Leu Val Ser Asp Asp Asp Tyr Gly Arg Glu Asn Ser His 
210 215 220 



Leu Leu Ser Gin Arg Leu Thr Asn Thr Gly Asp lie Cys lie Ala Phe 
225 230 235 240 



Gin Glu Val Leu Pro Val Pro Glu Pro Asn Gin Ala Val Arg Pro Glu 
245 250 255 



Glu Gin Asp Gin Leu Asp Asn lie Leu Asp Lys Leu Arg Arg Thr Ser 
260 265 270 



Ala Arg Val Val Val lie Phe Ser Pro Glu Leu Ser Leu His Asn Phe 
275 280 285 



Phe Arg Glu Val Leu Arg Trp Asn Phe Thr Gly Phe Val Trp lie Ala 
290 295 300 



Ser Glu Ser Trp Ala lie Asp Pro Val Leu His Asn Leu Thr Glu Leu 
305 310 315 320 



Arg His Thr Gly Thr Phe Leu Gly Val Thr lie Gin Arg Val Ser lie 
325 330 335 



Pro Gly Phe Ser Gin Phe Arg Val Arg His Asp Lys Pro Glu Tyr Pro 
340 345 * 350 
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Met Pro Asn Glu Thr Ser Leu Arg Thr Thr Cys Asn Gin Asp Cys Asp 
355 360 365 



Ala Cys Met Asn lie Thr Glu Ser Phe Asn Asn Val Leu Met Leu Ser 
370 375 380 



Gly Glu Arg Val Val Tyr Ser Val Tyr Ser Ala Val Tyr Ala Val Ala 
385 390 395 400 



His Thr Leu His Arg Leu Leu His Cys Asn Gin Val Arg Cys Thr Lys 
405 410 415 



Gin lie Val Tyr Pro Trp Gin Leu Leu Arg Glu lie Trp His Val Asn 
420 425 430 



Phe Thr Leu Leu Gly Asn Gin Leu Phe Phe Asp Glu Gin Gly Asp Met 
435 440 445 



Pro Met Leu Leu Asp lie lie Gin Trp Gin Trp Gly Leu Ser Gin Asn 
450 455 460 



Pro Phe Gin Ser lie Ala Ser Tyr Ser Pro Thr Glu Thr Arg Leu Thr 
465 470 475 480 



Tyr lie Ser Asn Val Ser Trp Tyr Thr Pro Asn Asn Thr Val Pro lie 
485 490 495 



Ser Met Cys Ser Lys Ser Cys Gin Pro Gly Gin Met Lys Lys Pro lie 
500 505 " 510 



Gly Leu His Pro Cys Cys Phe Glu Cys Val Asp Cys Pro Pro Gly Thr 
515 520 ' ^ 525 



Tyr Leu Asn Arg Ser Val Asp Glu Phe Asn Cys Leu Ser Cys Pro Gly 
530 535 540 



Ser Met Trp Ser Tyr Lys Asn Asn lie Ala Cys Phe Lys Arg Arg Leu 
545 550 555 560 



Ala Phe Leu Glu Trp His Glu Val Pro Thr He Val Val Thr He Leu 
565 570 575 



Ala Ala Leu Gly Phe He Ser Thr Leu Ala He Leu Leu He Phe Trp 
580 585 590 



Arg His Phe Gin Thr Pro Met Val Arg Ser Ala Gly Gly Pro Met Cys 
595 600 605 



Phe Leu Met Leu Val Pro Leu Leu Leu Ala Phe Gly Met Val Pro Val 
610 615 620 



Tyr Val Gly Pro Pro Thr Val Phe Ser Cys Phe Cys Arg Gin Ala Phe 
625 630 635 640 
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Phe Thr Val Cys Phe Ser Val Cys Leu Ser Cys lie Thr Val Arg Ser 
645 650 655 



Phe Gin lie Val Cys Val Phe Lys Met Ala Arg Arg Leu Pro Ser Ala 
660 665 670 



Tyr Gly Phe Trp Met Arg Tyr His Gly Pro Tyr Val Phe Val Ala Phe 
675 680 685 



lie Thr Ala Val Lys Val Ala Leu Val Ala Gly Asn Met Leu Ala Thr 
690 695 700 



Thr lie Asn Pro lie Gly Arg Thr Asp Pro Asp Asp Pro Asn lie lie 
705 710 715 720 



lie Leu Ser Cys His Pro Asn Tyr Arg Asn Gly Leu Leu Phe Asn Thr 
725 730 735 



Ser Met Asp Leu Leu Leu Ser Val Leu Gly Phe Ser Phe Ala Tyr Val 
740 745 750 



Gly Lys Glu Leu Pro Thr Asn Tyr Asn Glu Ala Lys Phe lie Thr Leu 
755 760 ' 765 



Ser Met Thr Phe Ser Phe Thr Ser Ser lie Ser Leu Cys Thr Phe Met 
770 775 780 



Ser Val His Asp Gly Val Leu Val Thr lie Met Asp Leu Leu Val Thr 
785 790 795 800 



Val Leu Asn Phe Leu Ala lie Gly Leu Gly Tyr Phe Gly Pro Lys Cys 
805 810 815 



Tyr Met lie Leu Phe Tyr Pro Glu Arg Asn Thr Ser Ala Tyr Phe Asn 
820 825 830 



Ser Met lie Gin Gly Tyr Thr Met Arg Lys Ser 
835 840 



<210> 19 
<211> 843 
<212> PRT 

<213> Rattus rattus 
<400> 19 

Met Gly Pro Gin Ala Arg Thr Leu Cys Leu Leu Ser Leu Leu Leu His 
15 10 15 



Val Leu Pro Lys Pro Gly Lys Leu Val Glu Asn Ser Asp Phe His Leu 
20 ^ 25 30 



Ala Gly Asp Tyr Leu Leu Gly Gly Leu Phe Thr Leu His Ala Asn Val 
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35 40 45 



Lys Ser He Ser His Leu Ser Tyr Leu Gin Val Pro Lys Cys Asn Glu 
50 55 60 



Phe Thr Met Lys Val Leu Gly Tyr Asn Leu Met Gin Ala Met Arg Phe 
65 70 75 80 



Ala Val Glu Glu He Asn Asn Cys Ser Ser Leu Leu Pro Gly Val Leu 
85 90 95 



Leu Gly Tyr Glu Met Val Asp Val Cys Tyr Leu Ser Asn Asn He His 
100 105 110 



Pro Gly Leu Tyr Phe Leu Ala Gin Asp Asp Asp Leu Leu Pro He Leu 
115 * 120 " * 125 



Lys Asp Tyr Ser Gin Tyr Met Pro His Val Val Ala Val He Gly Pro 
130 135 140 



Asp Asn Ser Glu Ser Ala lie Thr Val Ser Asn He Leu Ser His Phe 
145 150 155 160 



Leu He Pro Gin He Thr Tyr Ser Ala He Ser Asp Lys Leu Arg Asp 
165 170 175 



Lys Arg His Phe Pro Ser Met Leu Arg Thr Val Pro Ser Ala Thr His 
180 185 190 



His lie Glu Ala Met Val Gin Leu Met Val His Phe Gin Trp Asn Trp 
195 200 205 



He Val Val Leu Val Ser Asp Asp Asp Tyr Gly Arg Glu Asn Ser His 
210 215 - - - 22Q 



Leu Leu Ser Gin Arg Leu Thr Lys Thr Ser Asp lie Cys lie Ala Phe 
225 230 235 ~ 240 



Gin Glu Val Leu Pro lie Pro Glu Ser Ser Gin Val Met Arg Ser Glu 
245 250 255 



Glu Gin Arg Gin Leu Asp Asn lie Leu Asp Lys Leu Arg Arg Thr Ser 
260 265 270 



Ala Arg Val Val Val Val Phe Ser Pro Glu Leu Ser Leu Tyr Ser Phe 
275 280 285 



Phe His Glu Val Leu Arg Trp Asn Phe Thr Gly Phe Val Trp lie Ala 
290 295 300 



Ser Glu Ser Trp Ala lie Asp Pro Val Leu His Asn Leu Thr Glu Leu 
305 310 315 320 
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Arg His Thr Gly Thr Phe Leu Gly Val Thr lie Gin Arg Val Ser lie 
325 330 335 



Pro Gly Phe Ser Gin Phe Arg Val Arg Arg Asp Lys Pro Gly Tyr Pro 
340 345 350 



Val Pro Asn Thr Thr Asn Leu Arg Thr Thr Cys Asn Gin Asp Cys Asp 
355 3 60 3 65 



Ala Cys Leu Asn Thr Thr Lys Ser Phe Asn Asn lie Leu lie Leu Ser 
370 375 380 



Gly Glu Arg Val Val Tyr Ser Val Tyr Ser Ala Val Tyr Ala Val Ala 
385 390 395 400 



His Ala Leu His Arg Leu Leu Gly Cys Asn Arg Val Arg Cys Thr Lys 
405 "* 410 415 



Gin Lys Val Tyr Pro Trp Gin Leu Leu Arg Glu lie Trp His Val Asn 
420 425 430 



Phe Thr Leu Leu Gly Asn Arg Leu Phe Phe Asp Gin Gin Gly Asp Met 
435 440 445 



Pro Met Leu Leu Asp lie lie Gin Trp Gin Trp Asp Leu Ser Gin Asn 
4 50 4 55 ' 4 60 



Pro Phe Gin Ser lie Ala Ser Tyr Ser Pro Thr Ser Lys Arg Leu Thr 
465 470 475 * 480 



Tyr lie Asn Asn Val Ser Trp Tyr Thr Pro Asn Asn Thr Val Pro Val 
485 490 495 



Ser Met Cys Ser Lys Ser Cys Gin Pro Gly Gin Met Lys Lys Ser Val 
500 505 510 



Gly Leu His Pro Cys Cys Phe Glu Cys Leu Asp Cys Met Pro Gly Thr 
515 520 525 



Tyr Leu Asn Arg Ser Ala Asp Glu Phe Asn Cys Leu Ser Cys Pro Gly 
530 535 540 



Ser Met Trp Ser Tyr Lys Asn Asp lie Thr Cys Phe Gin Arg Arg Pro 
545 550 555 560 



Thr Phe Leu Glu Trp His Glu Val Pro Thr lie Val Val Ala He Leu 
565 570 575 



Ala Ala Leu Gly Phe Phe Ser Thr Leu Ala He Leu Phe He Phe Trp 
580 585 590 



Arg His Phe Gin Thr Pro Met Val Arg Ser Ala Gly Gly Pro Met Cys 
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595 600 605 



Phe Leu Met Leu Val Pro Leu Leu Leu Ala Phe Gly Met Val Pro Val 
610 615 620 



Tyr Val Gly Pro Pro Thr Val Phe Ser Cys Phe Cys Arg Gin Ala Phe 
625 630 635 640 



Phe Thr Val Cys Phe Ser lie Cys Leu Ser Cys lie Thr Val Arg Ser 
645 650 655 



Phe Gin lie Val Cys Val Phe Lys Met Ala Arg Arg Leu Pro Ser Ala 
660 665 670 



Tyr Ser Phe Trp Met Arg Tyr His Gly Pro Tyr Val Phe Val Ala Phe 
675 680 685 



lie Thr Ala He Lys Val Ala Leu Val Val Gly Asn Met Leu Ala Thr 
690 695 700 



Thr He Asn Pro He Gly Arg Thr Asp Pro Asp Asp Pro Asn He Met 
705 710 715 720 



He Leu Ser Cys His Pro Asn Tyr Arg Asn Gly Leu Leu Phe Asn Thr 
725 730 735 



Ser Met Asp Leu Leu Leu Ser Val Leu Gly Phe Ser Phe Ala Tyr Met 
740 745 750 



Gly Lys Glu Leu Pro Thr Asn Tyr Asn Glu Ala Lys Phe He Thr Leu 
755 760 765 



Ser Met Thr Phe Ser Phe Thr Ser Ser He Ser Leu Cys Thr Phe Met 
770 775 780 



Ser Val His Asp Gly Val Leu Val Thr He Met Asp Leu Leu Val Thr 
785 790 795 800 



Val Leu Asn Phe Leu Ala He Gly Leu Gly Tyr Phe Gly Pro Lys Cys 
805 810 815 



Tyr Met lie Leu Phe Tyr Pro Glu Arg Asn Thr Ser Ala Tyr Phe Asn 
820 825 830 



Ser Met lie Gin Gly Tyr Thr Met Arg Lys Ser 

840 





835 


<210> 


20 


<211> 


839 


<212> 


PRT 


<213> 


Homo 


<400> 


20 
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Met Gly Pro Arg Ala Lys Thr lie Cys Ser Leu Phe Phe Leu Leu Trp 
15 10 15 



Val Leu Ala Glu Pro Ala Glu Asn Ser Asp Phe Tyr Leu Pro Gly Asp 
20 25 30 



Tyr Leu Leu Gly Gly Leu Phe Ser Leu His Ala Asn Met Lys Gly lie 
35 40 45 



Val His Leu Asn Phe Leu Gin Val Pro Met Cys Lys Glu Tyr Glu Val 
50 55 * 60 



Lys Val He Gly Tyr Asn Leu Met Gin Ala Met Arg Phe Ala Val Glu 
65 70 75 80 



Glu lie Asn Asn Asp Ser Ser Leu Leu Pro Gly Val Leu Leu Gly Tyr 
85 90 95 



Glu He Val Asp Val Cys Tyr He Ser Asn Asn Val Gin Pro Val Leu 
100 105 110 



Tyr Phe Leu Ala His Glu Asp Asn Leu Leu Pro He Gin Glu Asp Tyr 
115 120 125 



Ser Asn Tyr He Ser Arg Val Val Ala Val lie Gly Pro Asp Asn Ser 
130 135 140 



Glu Ser Val Met Thr Val Ala Asn Phe Leu Ser Leu Phe Leu Leu Pro 
145 150 155 160 



Gin He Thr Tyr Ser Ala He Ser Asp Glu Leu Arg Asp Lys Val Arg 
165 170 * " 175 



Phe Pro Ala Leu Leu Arg Thr Thr Pro Ser Ala Asp His His Val Glu 
180 185 190 



Ala Met Val Gin Leu Met Leu His Phe Arg Trp Asn Trp He He Val 
195 200 205 



Leu Val Ser Ser Asp Thr Tyr Gly Arg Asp Asn Gly Gin Leu Leu Gly 
210 215 220 



Glu Arg Val Ala Arg Arg Asp He Cys He Ala Phe Gin Glu Thr Leu 
225 230 235 240 

/ 

Pro Thr Leu Gin Pro Asn Gin Asn Met Thr Ser Glu Glu Arg Gin Arg 
245 250 255 



Leu Val Thr He Val Asp Lys Leu Gin Gin Ser Thr Ala Arg Val Val 
260 265 270 



Val Val Phe Ser Pro Asp Leu Thr Leu Tyr His Phe Phe Asn Glu Val 
275 280 285 
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Leu Arg Gin Asn Phe Thr Gly Ala Val Trp lie Ala Ser Glu Ser Trp 
290 295 300 



Ala lie Asp Pro Val Leu His Asn Leu Thr Glu Leu Gly His Leu Gly 
305 310 315 320 



Thr Phe Leu Gly lie Thr lie Gin Ser Val Pro lie Pro Gly Phe Ser 
325 330 335 



Glu Phe Arg Glu Trp Gly Pro Gin Ala Gly Pro Pro Pro Leu Ser Arg 
340 345 350 



Thr Ser Gin Ser Tyr Thr Cys Asn Gin Glu Cys Asp Asn Cys Leu Asn 
355 360 " 36.5 



Ala Thr Leu Ser Phe Asn Thr lie Leu Arg Leu Ser Gly Glu Arg Val 
370 375 380 



Val Tyr Ser Val Tyr Ser Ala Val Tyr Ala Val Ala His Ala Leu His 
385 390 395 400 



Ser Leu Leu Gly Cys Asp Lys Ser Thr Cys Thr Lys Arg Val Val Tyr 
4 05 410 415 



Pro Trp Gin Leu Leu Glu Glu lie Trp Lys Val Asn Phe Thr Leu Leu 
420 425 ^ 430 



Asp His Gin lie Phe Phe Asp Pro Gin Gly Asp Val Ala Leu His Leu 
435 440 445 



Glu lie Val Gin Trp Gin Trp Asp Arg Ser Gin Asn Pro Phe Gin Ser 
450 455 460 



Val Ala Ser Tyr Tyr Pro Leu Gin Arg Gin Leu Lys Asn lie Gin Asp 
465 470 475 480 



lie Ser Trp His Thr Val Asn Asn Thr lie Pro Met Ser Met Cys Ser 
485 490 495 



Lys Arg Cys Gin Ser Gly Gin Lys Lys Lys Pro Val Gly lie His Val 
500 505 510 



Cys Cys Phe Glu Cys He Asp Cys Leu Pro Gly Thr Phe Leu Asn His 
515 520 525 



Thr Glu Asp Glu Tyr Glu Cys Gin Ala Cys Pro Asn Asn Glu Trp Ser 
530 535 540 



Tyr Gin Ser Glu Thr Ser Cys Phe Lys Arg Gin Leu Val Phe Leu Glu 
545 550 555 560 
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Trp His Glu Ala Pro Thr lie Ala Val Ala Leu Leu Ala Ala Leu Gly 
565 570 575 



Phe Leu Ser Thr Leu Ala lie Leu Val He Phe Trp Arg His Phe Gin 
580 585 590 



Thr Pro He Val Arg Ser Ala Gly Gly Pro Met Cys Phe Leu Met Leu 
595 600 605 



Thr Leu Leu Leu Val Ala Tyr Met Val Val Pro Val Tyr Val Gly Pro 
610 615 620 



Pro Lys Val Ser Thr Cys Leu Cys Arg Gin Ala Leu Phe Pro Leu Cys 
625 630 635 640 



Phe Thr He Cys He Ser Cys He Ala Val Arg Ser Phe Gin He Val 
645 650 655 



Cys Ala Phe Lys Met Ala Ser Arg Phe Pro Arg Ala Tyr Ser Tyr Trp 
660 665 670 



Val Arg Tyr Gin Gly Pro Tyr Val Ser Met Ala Phe He Thr Val Leu 
675 680 685 



Lys Met Val He Val Val He Gly Met Leu Ala Thr Gly Leu Ser Pro 
690 695 700 



Thr Thr Arg Thr Asp Pro Asp Asp Pro Lys He Thr He Val Ser Cys 
705 710 715 720 



Asn Pro Asn Tyr Arg Asn Ser Leu Leu Phe Asn Thr Ser Leu Asp Leu 
725 730 735 



Leu Leu Ser Val Val Gly Phe Ser Phe Ala Tyr Met Gly Lys Glu Leu 
740 745 750 



Pro Thr Asn Tyr Asn Glu Ala Lys Phe lie Thr Leu Ser Met Thr Phe 
755 760 765 



Tyr Phe Thr Ser Ser Val Ser Leu Cys Thr Phe Met Ser Ala Tyr Ser 
770 775 780 



Gly Val Leu Val Thr He Val Asp Leu Leu Val Thr Val Leu Asn Leu 

785 790 795 800 



Leu Ala He Ser Leu Gly Tyr Phe Gly Pro Lys Cys Tyr Met He Leu 
805 810 815 



Phe Tyr Pro Glu Arg Asn Thr Pro Ala Tyr Phe Asn Ser Met He Gin 
820 825 830 



Gly Tyr Thr Met Arg Arg Asp 
835 
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<210> 


21 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


21 



actttgagaa catgagtaat gacg 24 



<210> 


22 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


22 



agtacccgga ctgcgtcgtc atta 24 



<210> 


23 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


23 



cactagggtc atccttgctt tcag 24 



<210> 
<211> 
<212> 
<213> 



24 
24 
DNA 

Artificial 



Sequence 



<220> 

<223> Overgo probes 
<400> 24 

agtcagggtg atgggcctga aagc 



<210> 


25 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


25 



atgtggtgga ctggctgtac catc 24 



<210> 26 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Overgo probes 



Page 4 6 



<400> 26 

ttgaagccct ccacgtgatg gtac 
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24 



<210> 27 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Overgo probes 

<400> 27 

cacacggtga acaagatcac cttc 24 



<210> 28 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 



Overgo probes 



<400> 28 

agtagcactg ctcggagaag gtga 



24 



<210> 


29 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


29 



atctaccaca tggacgagga ggag 24 



<210> 


30 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


30 



tgaccaggta cggcgtctcc tcct 24 



<210> 


31 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


31 



agcgcgtcac gctggccgac ttca 24 



<210> 32 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 



Sequence 
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<400> 


32 


ttgctgagca cgttcttgaa gtcg 


<210> 


33 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


33 


cacgcctaca aattcttctt taag 


<210> 


34 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


34 


agtcctggtc catggactta aaga 


<210> 


35 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


35 



cttccactcc tgctgctacg actg 



<210> 


36 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


36 


tgcctcgcag tccacgcagt cgta 


<210> 


37 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


37 


aggtgcgccg cgtcaagggc ttcc 


<210> 


38 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 



24 



24 



24 



24 



24 
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<220> 




<223> 


Overgo probes 


<400> 


38 


tcgtagcagc aggagtggaa gccc 


<210> 


39 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


39 



gttcctggca tggggggagc cggc 



<210> 


40 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


40 


gagcagcaca agcacagccg gctc 


<210> 


41 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


41 


acagcccact agttcaggcc gcag 


<210> 


42 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


42 


caggcccggg gtccccctgc ggcc 


<210> 


43 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


43 


cccactggtt caggcctcgg gggg 


<210> 


44 


<211> 


24 



24 



24 



24 



24 



24 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Overgo probes 
<400> 44 

aaagcaggcc aggggccccc ccga 24 



<210> 


45 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


45 



aggcgctggt gcactgccgc acac 24 



<210> 


46 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


46 



aagctgaccc aggagcgtgt gcgg 24 



<210> 


47 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


47 



acagaggcac tggtgcactg ccgc 24 



<210> 


48 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


48 



tgatccagga gtgcacgcgg cagt 24 



<210> 


49 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


49 



accaatgcca cgctggcctt tctc 24 
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<210> 


50 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


50 



aagtgcccag gaagcagaga aagg 24 



<210> 


51 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


51 



tggtacatgc tgccaatgcc acgc 24 



<210> 


52 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


52 



aagcagagga aagccagcgt ggca 24 



<210> 


53 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


53 



tacaaccgtg cccgtggcct cacc 24 



<210> 


54 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


54 



aggccagcat ggcgaaggtg aggc 24 



<210> 


55 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


55 



tcatcacctg ggtctccttt gtgc 24 
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<210> 


56 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


56 



acattggcca ggaggggcac aaag 24 



<210> 


57 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


57 



tgcagatggg tgccctcctg ctct 24 



<210> 


58 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Overgo probes 


<400> 


58 



aggatgccca gcacacagag cagg 24 



<210> 59 

<211> 9049 

<212> DNA 

<213> Felis catus 



<220> 

<221> misc_feature 

<222> (14) . . (14) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (47) . . (47) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (56) . . (56) 

<223> n is a, c, g, or t 

<220> 

<221> mis c__f eature 

<222> (67) . . (67) 

<223> n is a, c, g r or t 

<220> 

<221> misc_f eature 

<222> (2113) . . (2113) 

<223> n is a, c, g, or t 

<220> 
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<221> misc_f eature 

<222> (2121) . . (2121) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (2125) . . (2132) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (2138) . . (2138) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (4198) . . (4198) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (4232) . . (4232) 

<223> n is a, c, g, or t 

<220> 

<221> mis cofeature 

<222> (4237) . . (4237) 

<223> n is a, c, g, or t 

<220> 

<221> rnisc_feature 

<222> (4256) . . (4256) 

<223> n is a, c, g, or t 

<220> 

<221> rnisc_feature 

<222> (4264) . . (4264) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (4272) . . (4272) 

<223> n is a, c, g, or t 

<220> 

<221> mis c_f eature 

<222> (4298) . . (4298) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc__f eature 

<222> (4328) . . (4328) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (4341) . . (4341) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (4343) . . (4343) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (4354) . . (4354) 

<223> n is a, c, g, or t 

<220> 
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<221> misc_feature 

<222> (4386) . . (4386) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (4389) . . (4390) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (4399) . . (4456) 

<223> n is a, c, g, or t 

<400> 59 



ctggaaaaaa 


aggngaaccc 


aggatgattc 


accccaaaat 


ttcagtntca 


gaaaantgag 


60 


gactggnagg 


a ggt caactt 


aaagtcagtt 


tcatt tggta 


aactgaggee 


caggtaaaaa 


120 


gttctaaaac 


ccacagct cc 


ct tcca tatt 


ctgt ccccca 


gagaagcagt 


gtccctgcct 


180 








CtC\C 4 CCCCf' t" t" 


c 1~ cf ^ etc c ctrtcf 




24 0 


ggcaccagag 


cgagaacaga 


acccacaacc 


atccagaggg 


aggggcagcg 


gccaccacct 


300 


cTcrc t~ 1" err s cc 


tgtgcctt ca 


ccctgcccag 


ttcc tgagt a 


ggacegcagg 


cccggaaggc 


360 


o d o.y u *■ — ci uu - 


f{ nrrf <ra"t~ 1" c 


ct acgactgg 


gt tccagccc 


cacccct ggc 


acaggcgtga 


420 






accactatct 


attctattta 


aacaacccraa 

a. n v_*- a. \^ \^ \_» \^ 


ctaatcaaaa 


480 


rrrrt' nrt rrrrrt' 


y y v — n ^ ^ ci 


ggcacaggac 


ggact ggeca 


gcatgtcact 


cccggcggct 


540 


cacc tggtcg 


gcctgcagct 


ctccctctcc 


tgctgctggg 


ctct cagctg 


ccacagcaca 


600 


gagacgt ct g 


ccgactt cag 


cctccctggg 


gat tacct cc 


tegcaggtet 


gttccct ctg 


660 


cact ctgact 


gt ccgggcgt 


gaggcacegg 


cccacggtga 


ccctctgtga 


caggtgagtg 


720 


ciggggtCCCg 


tgectet agg 


acctc t gece 


atcctctgtc 


c tcctcagtg 


aggatccttg 


780 


ggt t gttgat 


tgagt ggagt 


tagggecttt 


taaaaaacta 


agactctaga 


agctaaacca 


840 


cgtgtt gctt 


tacct gtctt 


ccaccctgag 


gat cacacgt 


taagtgttct 


taccagt caa 


900 


aattgaatat 


gtatcaaaca 


aaaataaatg 


gccttccatg 


ctgaaataac 


aaaaaacaga 


960 


cacgcatgga 


gaacctactt 


tgtggggcgc 


ctgggtggcc 


cagteggtta 


agtgtctgcc 


1020 


tcttcgtttt 


ggctcaggtc 


atgacctegg 


ggttcatgag 


ttcgagcccc 


gcgtcagctc 


1080 


cgtgatgagc 


ctggagcccg 


cttggaattc 


cctccccacc 


cccacccccc 


gctcatgcca 


1140 


gctcgagctc 


tcgctcactc 


tctcaaaata 


aacttaagag 


gggcgcctgg 


gtggcgcagt 


1200 


cagttaagcg 


tccgacttca 


gccaggtcac 


gatcagcaca 


ttatttcctg 


gaccttccat 


1260 


tctcctttcg 


ctgtacagag 


ettaaegtaa 


actccctggc 


aagacctcct 


ttctgatttt 


1320 


agaaaggeca 


gcttattggt 


ttggttcctg 


taatagctta 


aaaatagaat 


ccagctgtat 


1380 


caggaaacat 


ttaaaaaatg 


tatcaaggaa 


gacctataac 


agtaaaaata 


tttttaaatc 


1440 


ccagagtgtt 


ttcataaaga 


cacaggatta 


cattactcaa 


ttatttttaa 


agggtttttg 


1500 


aaaagccgtg 


tttcacttgc 


catggctaat 


gattataggc 


atccgaatga 


gcctgtggct 


1560 


atgacttcag 


tctgttcggt 


ggaaatgact 


ctgatgtcat 


aaactgactc 


ggcttcgctg 


1620 


acaggaaagt 


cgtacagaag 


aaaagctgtt 


cgagcccata 


tgttggttgc 


gctcaatgtc 


1680 
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aggaaggggc 


gacgtaatgt 


gtgcagaaat 


gggcagctgt 


cgagagtgaa 


gaaattggga 


1740 


agttggcacg 


gaagagggga 


ccgagtccga 


gaaggctgct 


ggataaagca 


gagcttttgc 


1800 


agaagagaag 


ggccggctgc 


tgtccctatc 


ctggtggcgg 


aaccacttag 


aaacaaggcg 


1860 


tcagaattag 


agacttcggt 


tcatgcaggg 


agggcggccc 


aggggggtgg 


cgtccttgga 


1920 


aactctggta 


agtttgagat 


tgatcccagg 


ggtcgtggga 


tggagcctcg 


catgagactc 


1980 


tacactgatc 


gatgagaagc 


agaagcccct 


tgtctgtgag 


gaaggggaca 


cgagcagttg 


2040 


gcacactaaa 


acgcaaggac 


acgtttctac 


gagaaaacgg 


tacatctgtc 


tgcgacacag 


2100 


aaagatcccc 


ggnaccagtc 


ntcgnnnnnn 


nnttccgntg 


ggattccagt 


cagcagttcc 


2160 


cgagaggcac 


tgaggaacac 


aggccctcac 


cacgttcaca 


agtgtcctga 


tgagagggat 


2220 


actaggtaaa 


cgaggttcga 


caggtgtggt 


ggttaatttt 


atacatcaac 


ctggctaggg 


2280 


tacggtgccc 


agttgtttgg 


ccaaacacca 


gtctagatgg 


ggctgtgaag 


gttaacattt 


2340 


aaaccaacag 


ggtgagtaaa 


gcagatcgct 


ttccattgtg 


tgggtgggcc 


tcatccaatc 


2400 


agttgaagac 


cttaaaagaa 


aagattgagg 


tccccccaaa 


aaggaagaaa 


ttctgccttc 


2460 


gaactcaaca 


ctgcagcttt 


gaccactgag 


agcatttcca 


gcctgccctg 


caaacgccag 


2520 


actcaccagc 


cccacaatca 


tgtgaaccaa 


ttccttaaaa 


taaacttctc 


tttctctctc 


2580 


tctatccaac 


tggttctgtt 


tctctgcaga 


accctgactc 


acgcagcagg 


tttccctgct 


2640 


acaggacttc 


atcagccttt 


caaccctaat 


atgctcatcc 


agggaggaat 


ggtttgtggt 


2700 


ttctccaagt 


tgtaaccgcc 


cctccccccc 


cgcccccgcc 


cccccaaagg 


cctgttaaca 


2760 


cagctgagtg 


tatggtacag 


ggcccacagt 


gaggtcatgg 


tggtagggga 


cgggacagat 


2820 


gccctcagag 


tttcctttct 


acccttcccc 


ccacccccga 


cgccaagagg 


gtctcggcaa 


2880 


ggccttgctc 


ctctgagctc 


tcagctgggc 


tttctctaca 


ggcccgacag 


cttcaacggt 


2940 


cacggctacc 


acctcttcca 


ggccatgcgg 


tttggcatcg 


aggagataaa 


caactccacg 


3000 


gccctcctgc 


cgaacgtcac 


cctgggatac 


cagctgtacg 


acgtgtgctc 


ggagtctgcc 


3060 


aacgtgtatg 


ccacactaaa 


cgtgctctcc 


ctgctgggga 


cacatcacgt 


agagatccga 


3120 


gcagaccctt 


cccactattc 


gcctgccgcc 


ctggctgtca 


ttgggcctga 


caccaccaac 


3180 


cacgcagcca 


ccactgcagc 


cctgctgagc 


cccttcctgg 


tgcccctggt 


gagctggagc 


3240 


ccgggggcct 


gtccatctcc 


cctgccggca 


ggtccagtgt 
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aegggagtec 


agtccaggct 


ccgagctgcc 


aataaagegg 


9000 


tgaaacatgc 


gtcctggctg 


ctctagctgt 


ctgaaccgag 


ggtggggcg 




9049 


<210> 60 
<211> 2526 
<212> DNA 
<213> Felis catus 












<400> 60 
atgtcactcc 


cggcggctca 


cp't~prT't~pprTP 


L iy Lay u L 


L L L L L L. L L l_ y 


LLyLLyyyLL 


D W 


ctcagctgcc 


acagcacaga 


PAPn"t~p1"Ppp 
y ciLy 


rfart"tparrrr 
y au l Luay 


llll Ly y y y a 


L. LOLL LLL LL 


± c. \j 


gcaggtctgt 


tccctctgca 


pppt"rippt"rr"t" 
l l_ l l v j ci l i_ y l 


i — uy y y > — y l y a 


yyLaLLy y ll 


LdLyy Ly cLll 


-LOU 


ctctgtgaca 


ggcccgacag 


l l i_ a a. i — y y l 


l a. l y y L l ci l l 


Cl L L l_ L ULLLu 


yyLLd Ly Lyy 


9 4 0 


tttggcatcg 


aggagataaa 


caac t ccacg 


cjpppt~ppt"np 

y l » l L i — * — L uy ^ 


L y C1CL L y L L Q L 


LLLyyyd lol 


300 


cagctgtacg 


acgtgtgctc 


ggagt ctgee 


aacgtgtatg 


ccacactaaa 


c pi - rrrtrt" re 

Ly L-yL LL LLL 


360 


ctgctgggga 


cacatcacgt 


agagatccga 


gcagaccctt 


cccactat tc 


nccf" ppppr~"p 

y L L L y L L L L L 


420 


ctggctgtca 


ttgggcctga 


caccaccaac 


cacgcagcca 


ccactgcagc 


cctgctgagc 


480 


cccttcctgg 


tgcccctgat 


cagctacgag 


gccagcagcg 


tgaegctegg 


agtgaagcgg 


540 


cattacccct 


cgtttctgcg 


caccatcccc 


agegacaage 


accaggtgga 


ggccatggtg 


600 


ctgctgctgc 


agagcttegg 


gtgggtctgg 


atctcggtgg 


teggcagega 


cggcgactac 


660 


gggcagctgg 


gggtgcaggc 


gctggaggag 


caggccaccc 


agcagggcat 


ctgcgttgcc 


720 


ttcaaggaca 


tcatcccctt 


ctctgcccgg 


ccgggcgacg 


agaggatgea 


gagcatcatg 


780 
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caccacctgg cccgagcgag gaccaccgtt gtggtcgttt tctccagcag gcagctggcc 840 

agggtgttct ttgagtcggt ggtgctggcc aacctgactg ccaaggtgtg gatcgcctca 900 

gaagactggg ccatctctag acacatcagc aatgtgcccg ggatccaggg cattggcacg 960 

gtgctgggtg tggccatcca gcagaggctt gtccctggcc tgaaggagtt tgaagaggcc 1020 

tatgtccagg cagataaggg ggcccctggg ccttgctcca ggacctccga gtgcagcagc 1080 

aaccagctct gtagagagtg tcgggctttc acggcagagc agatgcccac gctcggggca 1140 

ttctccatga gctctgctta taacgcctac cgggcagtct acgcagtggc ccatggcctc 1200 

caccagctcc tgggctgtgc ctctggagcc tgttccaggg accgagtcta cccctggcag 1260 

cttctggagc agatccgcaa ggtgaatttc ctcctacaca aggacaccgt gaggtttaat 1320 

gacaacgggg accctctcag tggctacgac ataattgcct gggactggag tggccccaag 1380 

tggaacttca gggtcattgg ctcctccatg tggcctccag ttcagctgga cataaataaa 1440 

accaaaatcc ggtggcacgg gaaggacaac caggtgccaa agtctgtgtg ctccagcgac 1500 

tgcctcgaag ggcaccagcg agtgatttcg ggtttctacc actgttgctt tgagtgtgtg 1560 

ccctgtgagg ccgggagctt cctcaacaag agcgacctcc acagctgcca gccttgtggg 1620 

aaagaaaagt gggcacccgc gggaagtgaa acctgctttc cacgcaccgt ggtgtttttg 1680 

acttggcacg agaccatctc ttgggtgctg ctggcagcta atacgttgct gctgctgctg 174 0 

gtgactggga ctgctggcct gtttgcctgg cacttagaca cccctgtggt gaagtccgct 1800 

gggggccgac tgtgcttctt catgctaggc tccctggcag ggggcagctg tgggctctac 1860 

ggcttttttg gggagcccac gctgcccaca tgcttgttgc gccaaagcct ccttgccctg 1920 

ggttttgcca tcttcctgtc ctgcctgacc atccgctcct tccaactggt cttcatcttc 1980 

aagttttctg ccaaggtacc caccttctac cgtgcctggg tccaaaacca cggtcctggc 204 0 

ctatttgtgg tgatcagctc aatggcccag ctgctcatct gtctaacttg gctggcggtg 2100 

tggaccccac tgcccaccag ggagtaccag cgcttccctc agctggtggt gcttgattgc 2160 

acagaggcca actcaccggg cttcatgttg gctttcgcct acaatggcct cctgtccgtc 2220 

agcgcctttg cctgcagcta cctgggcaag gacctgccag agaactacaa cgaggccaaa 2280 

tgtgtcactt ttagtctgct gctcaacttc gtgtcctgga ttgccttctt caccacggcc 2340 

agcgtctacc agggcaagta cttgcccgcg gtcaacgtgc tggcggcgct gagcagcctg 2400 

agtggcggct tcagcggtta tttcctcccc aagtgctacg tgatcctgtg ccgcccaaaa 24 60 

tttaacagca cacagcactt ccaggcctcc atccaggagt acacgaggcg ctgcggctcc 2520 

acctga 2526 

<210> 61 

<211> 841 

<212> PRT 

<213> Felis catus 

<400> 61 

Met Ser Leu Pro Ala Ala His Leu Val Gly Leu Gin Leu Ser Leu Ser 
15 10 15 
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Cys Cys Trp Ala Leu Ser Cys His Ser Thr Glu Thr Ser Ala Asp Phe 
20 25 30 



Ser Leu Pro Gly Asp Tyr Leu Leu Ala Gly Leu Phe Pro Leu His Ser 
35 40 45 



Asp Cys Pro Gly Val Arg His Arg Pro Thr Val Thr Leu Cys Asp Arg 
50 55 60 



Pro Asp Ser Phe Asn Gly His Gly Tyr His Leu Phe Gin Ala Met Arg 
65 70 75 80 



Phe Gly lie Glu Glu lie Asn Asn Ser Thr Ala Leu Leu Pro Asn Val 
85 90 95 



Thr Leu Gly Tyr Gin Leu Tyr Asp Val Cys Ser Glu Ser Ala Asn Val 
100 105 110 



Tyr Ala Thr Leu Asn Val Leu Ser Leu Leu Gly Thr His His Val Glu 
115 120 125 



lie Arg Ala Asp Pro Ser His Tyr Ser Pro Ala Ala Leu Ala Val lie 
130 135 140 



Gly Pro Asp Thr Thr Asn His Ala Ala Thr Thr Ala Ala Leu Leu Ser 
145 150 155 160 



Pro Phe Leu Val Pro Leu lie Ser Tyr Glu Ala Ser Ser Val Thr Leu 
165 170 175 



Gly Val Lys Arg His Tyr Pro Ser Phe Leu Arg Thr lie Pro Ser Asp 
180 185 190 



Lys His Gin Val Glu Ala Met Val Leu Leu Leu Gin Ser Phe Gly Trp 
195 200 205 



Val Trp lie Ser Val Val Gly Ser Asp Gly Asp Tyr Gly Gin Leu Gly 
210 215 220 



Val Gin Ala Leu Glu Glu Gin Ala Thr Gin Gin Gly He Cys Val Ala 
225 230 235 240 



Phe Lys Asp He He Pro Phe Ser Ala Arg Pro Gly Asp Glu Arg Met 
245 250 * 255 



Gin Ser He Met His His Leu Ala Arg Ala Arg Thr Thr Val Val Val 
260 265 270 



Val Phe Ser Ser Arg Gin Leu Ala Arg Val Phe Phe Glu Ser Val Val 
275 280 285 
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Leu Ala Asn Leu Thr Ala Lys Val Trp lie Ala Ser Glu Asp Trp Ala 
290 295 300 



lie Ser Arg His lie Ser Asn Val Pro Gly lie Gin Gly lie Gly Thr 
305 310 315 * 320 



Val Leu Gly Val Ala lie Gin Gin Arg Leu Val Pro Gly Leu Lys Glu 
325 330 " 335 



Phe Glu Glu Ala Tyr Val Gin Ala Asp Lys Gly Ala Pro Gly Pro Cys 
340 345 350 



Ser Arg Thr Ser Glu Cys Ser Ser Asn Gin Leu Cys Arg Glu Cys Arg 
355 * 360 365 



Ala Phe Thr Ala Glu Gin Met Pro Thr Leu Gly Ala Phe Ser Met Ser 
370 375 380 



Ser Ala Tyr Asn Ala Tyr Arg Ala Val Tyr Ala Val Ala His Gly Leu 
385 390 395 400 



His Gin Leu Leu Gly Cys Ala Ser Gly Ala Cys Ser Arg Asp Arg Val 
405 410 " J 415 



Tyr Pro Trp Gin Leu Leu Glu Gin lie Arg Lys Val Asn Phe Leu Leu 
420 425 " 430 



His Lys Asp Thr Val Arg Phe Asn Asp Asn Gly Asp Pro Leu Ser Gly 
435 440 445 



Tyr Asp lie lie Ala Trp Asp Trp Ser Gly Pro Lys Trp Asn Phe Arg 
450 455 460 



Val lie Gly Ser Ser Met Trp Pro Pro Val Gin Leu Asp lie Asn Lys 
465 470 475 480 



Thr Lys lie Arg Trp His Gly Lys Asp Asn Gin Val Pro Lys Ser Val 
485 " ~ 490 495 



Cys Ser Ser Asp Cys Leu Glu Gly His Gin Arg Val lie Ser Gly Phe 
500 505 510 



Tyr His Cys Cys Phe Glu Cys Val Pro Cys Glu Ala Gly Ser Phe Leu 

515 520 525 



Asn Lys Ser Asp Leu His Ser Cys Gin Pro Cys Gly Lys Glu Lys Trp 
530 535 540 



Ala Pro Ala Gly Ser Glu Thr Cys Phe Pro Arg Thr Val Val Phe Leu 
545 550 555 560 



Thr Trp His Glu Thr lie Ser Trp Val Leu Leu Ala Ala Asn Thr Leu 
565 570 575 
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Leu Leu Leu Leu Val Thr Gly Thr Ala Gly Leu Phe Ala Trp His Leu 
580 585 590 



Asp Thr Pro Val Val Lys Ser Ala Gly Gly Arg Leu Cys Phe Phe Met 
595 600 * 605 



Leu Gly Ser Leu Ala Gly Gly Ser Cys Gly Leu Tyr Gly Phe Phe Gly 
610 615 620 



Glu Pro Thr Leu Pro Thr Cys Leu Leu Arg Gin Ser Leu Leu Ala Leu 
625 630 635 640 



Gly Phe Ala lie Phe Leu Ser Cys Leu Thr lie Arg Ser Phe Gin Leu 
645 650 655 



Val Phe lie Phe Lys Phe Ser Ala Lys Val Pro Thr Phe Tyr Arg Ala 
660 665 670 



Trp Val Gin Asn His Gly Pro Gly Leu Phe Val Val lie Ser Ser Met 
675 680 685 



Ala Gin Leu Leu lie Cys Leu Thr Trp Leu Ala Val Trp Thr Pro Leu 
690 695 700 



Pro Thr Arg Glu Tyr Gin Arg Phe Pro Gin Leu Val Val Leu Asp Cys 
705 710 715 *" 720 



Thr Glu Ala Asn Ser Pro Gly Phe Met Leu Ala Phe Ala Tyr Asn Gly 
725 730 " 735 



Leu Leu Ser Val Ser Ala Phe Ala Cys Ser Tyr Leu Gly Lys Asp Leu 
740 745 750 



Pro Glu Asn Tyr Asn Glu Ala Lys Cys Val Thr Phe Ser Leu Leu Leu 
755 760 765 



Asn Phe Val Ser Trp He Ala Phe Phe Thr Thr Ala Ser Val Tyr Gin 
770 775 780 



Gly Lys Tyr Leu Pro Ala Val Asn Val Leu Ala Ala Leu Ser Ser Leu 
785 790 795 800 



Ser Gly Gly Phe Ser Gly Tyr Phe Leu Pro Lys Cys Tyr Val He Leu 
805 810 ~ " ^ 815 



Cys Arg Pro Lys Phe Asn Ser Thr Gin His Phe Gin Ala Ser He Gin 
820 825 830 



Glu Tyr Thr Arg Arg Cys Gly Ser Thr 
835 840 
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<210> 62 

<211> 10607 

<212> DNA 

<213> Felis catus 



<220> 

<221> misc_feature 

<222> (1604) . . (1683) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (2470) . . (2516) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (2537) . . (2537) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (2560) . . (2560) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (2574) . . (2574) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (2580) . . (2580) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (2599) . . (2599) 

<223> n is a, c, g, or t 

<220> 

<221> mis cofeature 

<222> (2850) . . (2850) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (5784) . . (5830) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (7512) . . (7553) 

<223> n is a, c, g, or t 

<220> 

<221> misc feature 

<222> (8626) . . (8626) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (10453) . . (10453) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (10491) . . (10491) 

<223> n is a, c, g, or t 
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<220> 

<221> mis cofeature 

<222> (10501) . . (10501) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (10511) . . (10511) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (10545) . . (10545) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (10558) . . (10558) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (10574) . . (10574) 

<223> n is a r c, g, or t 

<220> 

<221> misc_feature 

<222> (10599) . . (10599) 

<223> n is a, c, g, or t 

<400> 62 



ttagctgctg 


aaacgctgct 


ttttagcaaa 


aggccgtgac 


ctcatgatgt 


tatacgtcgt 


60 


ggagattgag 


aaccaggtcc 


tagcatctga 


ctatgtgctt 


tgagtcccca 


cttttgctgg 


120 


ttgtgcaacc 


cagggtgagc 


ttcgtaagct 


tctctgtgcc 


tcagttttct 


catctgtgga 


180 


atggggccgg 


tcatagtccc 


cgttattgtg 


atcatcgagc 


aagatggtga 


atggcgagca 


240 


cacagcatga 


tgcctagttc 


ttactggaac 


acctgtcctg 


ggtcaggggc 


tgtatataaa 


300 


gtactacctg 


ccaggatcaa 


cttgatccgg 


ttctattctg 


tctcctgggt 


gagtatctgt 


360 


gccctttact 


cccagatgtt 


ggaaatgtca 


ggggcatgag 


acctgtcctt 


aaccgagtgg 


420 


cagaaggtta 


agtttgtgtc 


cgagatagca 


ggacatgctt 


tctctacctc 


cgcagggcgt 


480 


tctcccagac 


cccccagggc 


ccaccatgcc 


ctgctaggaa 


gggatcatcc 


taattctagc 


540 


ctcttcttcc 


gccccagagt 


tctgaagctt 


ctccacctgt 


ccaggtgttt 


ccccacccct 


600 


tcagccacgg 


caagaccgtc 


actatgtaaa 


tgtctgtgca 


aatcccctgg 


tgtcaagctg 


660 


ccagctctct 


gatgaggcag 


ggccacctcc 


ggggacccct 


cacttcccag 


ccatgggacc 


720 


ccgggccagg 


gaagtctgct 


gcttcatcat 


cctgccgcgg 


ctcctggctg 


agccggctga 


780 


gaactcagac 


ttctacttgg 


ctggggatta 


cttcctcggc 


ggcctcttca 


ccctccatgc 


840 


caacgtgaag 


ggcatcgtcc 


acctcaacct 


cctgcaggtg 


ccccagtgca 


aggagtgagt 


900 


cgccaatgtg 


gggctggaag 


tggcgacggg 


ggcggagtgg 


gaagcctggg 


ctggtcctgt 


960 


gctcctcagg 


ggaccacgcc 


aggaccaagg 


gctcaaaatg 


ctcttcctca 


ttcattgcca 


1020 


acctctcatc 


ccgcattatc 


cccaccggcc 


tgcagggaga 


ccccatgcag 


ttcatgttac 


1080 


caaaatcttt 


ggcaattgta 


ttctgaaata 


tggagagctg 


gttgtcccgc 


cgtgtgtctt 


1140 
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aataaataaa 


gagttacagg 


gtacttgagc 


ctggaggggt 


tgtagagacc 


acccccacct 


1200 


actttgtcaa 


gtggggaact 


cctactgagt 


ccgtgtcaag 


tccaagtcta 


gacaccgggg 


1260 


gttatgcctt 


tggaaggcag 


aaatgtggtt 


tttcggtagc 


aggttctcag 


actggagggg-. 


1320 


aaggtttgca 


tttctctagg 


gctgtggtta 


ggtgggaagg 


ggtgcttcca 


ggaccagaag 


1380 


ggatttcctc 


cactcacctt 


gtcccctgtg 


agccctgggg 


gtggctgcat 


cactcaaggt 


1440 


tgggtgagac 


acctttgtgc 


aagtgcgaag 


gctgggatgg 


cggacccagc 


gtgggatgat 


1500 


gagatagtga 


cttgctgcag 


agagggtgaa 


ggcgtcctgt 


gagagaggga 


gagaaaaaag 


1560 


tctgtgacgt 


cggggaagat 


cacatgctgg 


cttgagaatg 


acgnnnnnnn 


nnnnnnnnnn 


1620 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


1680 


nnngatgtgg 


aggtgatrgt 


gatggcggtg 


attgtgacgg 


tggtatcggt 


gatggtggtc 


1740 


acagacaacg 


cagttatagt 


gatggcagtg 


gtgataggaa 


tagtaggtgg 


tgatggtcat 


1800 


tctggagatg 


tggcaggtga 


caacgatgag 


atgaaaatgc 


cagaatcttc 


tggagtggct 


1860 


ccttcttgag 


ccactcctcg 


gctttcctat 


ggcaggcaga 


ggggactccc 


cggctctcct 


1920 


gtcccttccc 


cctctcactc 


tggacctgcc 


tctcacccca 


ccccacatgg 


ctcccccagg 


1980 


tatgaaataa 


aggtgttggg 


ctacgatctc 


atgcaggcca 


tgtgctttgc 


aggggaggag 


2040 


atcaatagcc 


agagcagcct 


gctgcctggc 


gtgctgctgg 


gctacaaaat 


ggtggatgtc 


2100 


agctacatct 


ccaacaatgt 


ccagcccgtg 


ctccacttcc 


cggcaaagga 


ggactgttcc 


2160 


ttgcccatcc 


aggaggacta 


cagccactgt 


gtgccccgtg 


tggtggctgt 


cattggtcct 


2220 


ggcaactctg 


agtccactgt 


gactgtggcc 


cgcttcctct 


ctctcttcct 


ccttccacag 


2280 


gggaggcccc 


tgggtcctgg 


ggtaaggagc 


tggggggcag 


aggagtggtt 


atccaggggg 


2340 


ctcacttccc 


cccaccggtc 


ctgggggtag 


gaggaggcag 


gaagtagggt 


cagaatgtca 


2400 


accccaatcc 


trggaaggca 


gcccagccac 


gtggttaaga 


gctcaggctt 


ggaggcagac 


2460 


agacckgggn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnngcct 


2520 


tcagagagat 


catcctntca 


agggggccct 


tattcctttn 


cccctgggag 


cccntcagtn 


2580 


cccaccactt 


tctgcagcnc 


ccattcgggt 


ctccgattcc 


tccaatccac 


tcactcgctg 


2640 


tgtggctctg 


gataagtgac 


tgtccctctc 


tgaacctcag 


cgtcctcatc 


tgcaaagtgg 


2700 


agacataaca 


gcacatcaga 


aggtcgcgag 


aataggggcg 


cctgggaggc 


tcagtcggtt 


2760 


aagcatccga 


ttctgggtcg 


cggctcaggt 


catgatctcc 


cggttcgtga 


gttcaagccc 


2820 


cgcatcgggc 


tgtgtgctga 


cagcacagan 


cctgcttggg 


attctgtctt 


cccttctctc 


2880 


tgcccctcac 


ctgcttttgc 


tctctctctc 


tcaaaataaa 


taaataaact 


ttttaaaaaa 


2940 


aaggaaggta 


gtgagaaaaa 


agcgggtgac 


agagatggag 


agggctccac 


gcggtacctg 


3000 


gcatgctgcg 


agccctcaga 


acccgttagc 


gacggaagtg 


acctgtgtgc 


gtcgtcacca 


3060 


ccatcccagc 


aggccttgag 


gcttcgaccc 


tgcctccccc 


gcaaagctca 


cagtctccga 


3120 


ggctccgggc 


cacgtccccc 


gggcgtcctg 


tctgtgtccc 


tcgaaccccg 


cccagccctg 


3180 


ccgcaccgtg 


agctagtcag 


cgcctgctgg 


gttcgtgact 


ctctccgcca 


ttgtgcaccc 


3240 
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tggggctggg 


gccacaccca 


ggggctccgg 


ttaatttaga 


tgctttcttt 


ctctgccatc 


3300 


tgcttacccc 


cgagcttggt 


tagagagcct 


gactttgctg 


ggagtctcca 


gaacgtcccg 


3360 


ggacctccca 


gcaaccagca 


tctttattct 


ccctccttag 


aactgatgtg 


tgcagtcgct 


3420 


gtgcctctgc 


agctcagagc 


aggggtggtt 


cctgtgaact 


ggggccaggg 


gtggtttcct 


3480 


ggagggggca 


aggcaccgac 


tagccctcga 


agaaggagcc 


gggcttggct 


gaggtgggac 


3540 


agggggagag 


catgaggttt 


tcggccagct 


ttctgtgcct 


gggaaccccc 


tctccccaca 


3600 


accctggatc 


ccagaggcct 


taacgggccc 


cagctgtaac 


agactcgtct 


gtgtcgagca 


3660 


ttccacagta 


ggtgtcccca 


ggctccctcg 


gggccaccaa 


aggaccacaa 


cgacattacg 


3720 


cggacagggt 


ctcagattcc 


gatgggtccc 


ctgtttgctg 


gaaccatctc 


cctttggaaa 


3780 


tttacagctc 


tcttttctgg 


cagtaacccc 


gccccttggt 


gctgggtacg 


aagggggcac 


3840 


ccagagcggg 


gctcacccag 


cagcgctgac 


tgctgcgttg 


tcgggctaac 


gggtattaac 


3900 


cgcctccctc 


gccgctccca 


ttctcttagc 


tgctgaaacg 


ctgcttttta 


gcaaaggccg 


3960 


tgacctcatg 


atgttatacg 


tcgtggagat 


tgagaaccag 


gtcctagcat 


ctgactatgt 


4020 


gctttgagtc 


cccacttttg 


ctggttgtgc 


aacccagggt 


gagcttcgta 


agcttctctg 


4080 


tgcctcagtt 


ttctcatctg 


tggaatgtgt 


gagggggaga 


cctcagtttc 


aagcggggtg 


4140 


gccaggaggg 


cctttctgac 


aactggacaa 


cgacctgagg 


gagaggaagg 


agtgagggag 


4200 


ctatgtgggt 


gcctagaaga 


gcgctccgga 


agagggggca 


gcgaatgcag 


aggccggcag 


4260 


gagcctggtg 


cgttggctga 


accggtgagc 


agccccggga 


ccaggcggga 


cagtaggaga 


4320 


agatgaagcc 


agagaggtga 


gggccggggt 


cagtggtgga 


gccccttggg 


ggccactgaa 


4380 


ggactctggc 


tgtcctcgag 


tgacattagg 


agctgttggg 


gagttttgag 


ctgaggagta 


4440 


aggtgacgga 


caagtggtcg 


cagaggccac 


ccggctgcca 


cgaacagcag 


cagagacagc 


4500 


caaggggaag 


ggtggggggc 


tgtggtgacc 


ccgggagggt 


ggtgatggtg 


gcccggtgag 


4560 


gccctagctc 


acgctggcgg 


ccctccgctc 


tccggcagat 


cacctacagc 


gccatcagtg 


4620 


acgagctacg 


ggacaagcag 


cgcttcccgg 


cccttctgcc 


cacagcgccg 


ggcgccgatc 


4680 


accagatcga 


ggccatggtg 


cagctgatgt 


tgtacttccg 


ccggaactgg 


atcatcgcgc 


4740 


tggtgagcag 


cggcgactgc 


ggccgcgacg 


acagccagct 


gctcagcgat 


cgcccggccg 


4800 


gcggcgacac 


ctgcatcgcc 


ttccgggaga 


cgctgcccat 


gccccagccc 


aaccaggcgg 


4860 


tgacgcagtg 


ggagcgccgg 


cgcctgaagg 


ccatcgtgga 


cgagcagcag 


cggcagagct 


4920 


ctgcgcgcgt 


cgtggtcctg 


ctgtcgccaa 


agctggtcct 


gcacaacttc 


ttccgcgagg 


4980 


tgctccgcca 


gaacctcacg 


ggcgtcgtgc 


ggatcgcctc 


cgagtcctgg 


gccatcgacc 


5040 


cggtcctgca 


cgacaggccc 


acgcgctgca 


cagcctcctg 


ggctgcaccc 


agaccagcag 


5100 


ctccgggtcg 


tctatccctg 


gcaggtgagg 


ccccacccac 


ggagagtcgg 


ggccacacac 


5160 


gcaggcgccg 


ccacagccct 


gagtggttgc 


catggagacc 


actgccctgc 


tctagcgtcc 


5220 


ccctctctgg 


ccgggtcctg 


ggcaaactgg 


cgggagaggc 


caggggacgt 


accctgtccc 


5280 


cagacacata 


aagccagaag 


tgcttcatgg 


tgacaaaact 


ccttttttta 


cattaatgta 


5340 
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atcctcgcca 


tccaagatag 


cctgtcccgg 


caggagattt 


gggtgaagtt 


tcctggaagg 


5400 


aggcctggca 


ggcagtgggc 


cccctgggcc 


ccctgccgtt 


tctccagggt 


ggcggccttg 


5460 


ggggaggact 


tctgtgttca 


gctctctgag 


gctctgcttt 


gggtttatgc 


atcttctctc 


5520 


gtcccaggtc 


tggacgattc 


agaggagtaa 


ggaggcaagg 


agtcgcctgg 


attcagacct 


5580 


ggaatttaaa 


tctgtatttt 


tctgatctgc 


gtgcacaccc 


gcgcgtgcac 


acacacacac 


5640 


ctaaccacga 


agtttatgta 


ggtagaagat 


tttactgagg 


gggcgcctgg 


gtggctcagt 


5700 


cggttaagcg 


tccgacttca 


gccaggtcac 


gatctcgcgg 


tctgtgagtt 


cgagccccgc 


5760 


gtcaggctct 


gggctgatgg 


ctcnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


5820 


nnnnnnnnnn 


agcaccccga 


gggcccgggg 


gagggcacct 


gagcccgtaa 


agggaaacag 


5880 


gagtggcctc 


tgaacccagg 


tgataggtct 


ccgctggatg 


gcagacgtga 


ctcccacggg 


5940 


agcaggaata 


atgtcgacac 


atcggccgga 


aggggagcac 


ttcctggtgt 


gcagtcattg 


6000 


tgctaagctc 


ccaacattgg 


gaaactcatg 


cgttgcttca 


gagcccggga 


gacagggttt 


6060 


ttgttgtcct 


actttacaga 


agaggagact 


ggagctcacg 


ggggttgggc 


gacaggcccg 


6120 


aggctcagag 


caggtggcag 


agctggtgcc 


tgaacccagg 


tgtgtctgac 


tacagagccg 


6180 


gggctcccag 


ccgctgcctc 


ccgggtgacc 


acatctgcgg 


tctcattgcc 


cccttgtagg 


6240 


gatgtggaca 


cccagtctcg 


tggggtagtc 


actctccccc 


ggatcgagcc 


cgacttcttt 


6300 


tttttttttt 


aatttttttt 


tcaacgttta 


tttatttttg 


ggacagagag 


agacagagca 


6360 


tgaatgggcg 


aggggcagag 


agagagggag 


acacagaatc 


ggaaacaggc 


tccaggctcc 


6420 


gagccatcag 


cccagagcct 


gatgcggggc 


tcgaactcac 


ggaccgcgag 


atcgtgacct 


6480 


ggctgaagtc 


ggacacttac 


ccgaatgcgc 


cacccagggg 


cccagatcga 


gcccgacttc 


6540 


tgacgccagc 


gtcgcttcct 


ttccctgtgg 


cctcccagct 


gcttcaggaa 


atctggaagg 


6600 


tcaacttcac 


cctcctgggc 


caccagatct 


tttttgacca 


gcgaggggac 


ctactcatgc 


6660 


gcctggagat 


catccaggga 


cggtgggacc 


tgagccagaa 


cctttctgga 


gcgtcgcctc 


6720 


ctactgcccg 


gtgctacgac 


ggctgagggc 


catccgtgac 


gtctcctggc 


acacggccaa 


6780 


caacacggtc 


agctctcgga 


gggctggtgg 


ggggctggga 


cctgggtctg 


ggcactggct 


6840 


cgtgcagggg 


tggcaagggc 


cctgtggacc 


tgagatccat 


tatcgagcac 


tgatgtcatc 


6900 


cctatttgtg 


ggtgtccctc 


ctcccattga 


ctaagcactg 


tggaagtcta 


gagctttctg 


6960 


gatcctcagg 


acccaggggc 


tcagggggct 


gcacaaagtg 


aacgttaggt 


ggacacgtgt 


7020 


gtgctaagga 


cttcaattct 


catgtcaacc 


ctaggaaata 


gagagtactg 


ttcctcctgt 


7080 


ctttggggtt 


gggaaactgg 


aggcacagag 


ggggtcgcgt 


gacccataaa 


aggccacaca 


7140 


gctttcgcat 


gtctctatac 


acagcattca 


gtctacatcc 


catcgattag 


tactcgcgtt 


7200 


ttggggacag 


tagctgtgcc 


ttcacctgtg 


tctgacatct 


gtcagtctga 


aagctccttt 


7260 


gttttaccct 


cttagcttac 


aagctgtcag 


aatggccgcg 


atgtggggaa 


ggtagagact 


7320 


cagcctcgtg 


gggaaggggg 


gaggtggggg 


gacctaaaag 


ttcaaagagc 


cagggcacct 


7380 


gggtggctca 


gtcagttaag 


catccgactc 


tggatctcag 


ctcagtcttg 


atctcaggtc 


7440 
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gtgagtttag 


acccctgtgt 


agggctccgt 


gctgggcgcg 


cagcctactt 


aaaaataata 


7500 


aaaacaaaag 


cnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnngatcccc 


7560 


gtgtccatgt 


gttccaagga 


ctgccagcct 


gggcaaagga 


agaagcccgt 


gggtattcat 


7620 


ccctgctgct 


tcgagtgtct 


cgactgcctt 


ccgggcacct 


tcctcaacca 


aactgcagat 


7680 


gggactcaca 


gacccacacc 


cctgccctgc 


cctgccctgc 


cccgccctgg 


ggctcccagg 


7740 


gcccttcatc 


tttggcaggg 


tctctggagt 


ctcatccagg 


ggacacaggt 


gtccaaaggc 


7800 


cagggaccat 


gttttgactc 


cgcttgtatc 


tccctaaccg 


ctggtgtaag 


aaaaatcttc 


7860 


aatgctgtga 


gggcgtgggg 


gtgggagaag 


gaacagccct 


caaccaggcg 


aggctgtaac 


7920 


tgatcccctc 


tgcacacaca 


tgtagctgag 


ggcccagggg 


ggtcaggcca 


gagaatgtcc 


7980 


accggatgaa 


cgaacgaatg 


aatgaatgaa 


cgaacgaaca 


aacacacaaa 


tgaatgaatg 


8040 


tctctgtccg 


tagaagaaat 


gtttctggca 


gacagggcta 


ggatctaatt 


tctctctgtg 


8100 


gcctcccgag 


tgcctcgtgt 


agttcggagc 


atataatgtt 


tgctcagtga 


atgtttattg 


8160 


agtgacatcc 


ttgatgagaa 


gaattgacat 


ctccccctat 


agatcataaa 


ctccaggaaa 


8220 


ggggggacaa 


tgtcatccct 


ccagtgttta 


ccacagttca 


ccgttggggc 


cgaattattt 


8280 


ttttttcatg 


acttcacaga 


ttagtaacta 


agcggttctg 


tacatctacc 


gatcagagta 


8340 


cttacgacgt 


gcccagcaga 


gcccagggca 


cagggtaggt 


gctcaacaaa 


agtttgtttg 


8400 


caattgatca 


gtagccggaa 


gtcagggggc 


tcggttttat 


ccacgtctgt 


gctctccatc 


8460 


tcagatgcct 


atcacagtgg 


gtggcgctca 


aaaagaaact 


tgaataaacg 


gtcgaatgtc 


8520 


catctcacca 


gagggtacgg 


tcttggaagg 


gaggcattac 


ggttgccagg 


ctctgagtca 


8580 


aggggacctt 


ggaccacatc 


ctgcctctgt 


aactggtttt 


gtaacngcct 


ggaggagcct 


8640 


cagatgccac 


atctgtgaaa 


tggggttgca 


gtgaggatct 


gatgggccgg 


tggatacgag 


8700 


ggacgcagtg 


agaggtgcta 


cgaccgcagg 


catcgccctt 


ggctcgcccc 


ctccctaccc 


8760 


ctacagccgg 


ccgggtgcag 


gtgcagagga 


tgtgggtgcc 


gggaaggtgg 


gtgtatctga 


8820 


tggaactgct 


gtgggctctt 


gcagacgagt 


ttggctgccg 


gccctgcccg 


agttgcgggt 


8880 


ggtcccggag 


gaacgacgct 


tcgtgcttca 


agcggcggct 


ggcctccctt 


gaatgacgcg 


8940 


aggcacccgc 


cgtcgctgtg 


gccgtgctgt 


ccatcctggg 


ctccctctgc 


accctggcca 


9000 


tcctggtgat 


cttctggagg 


caccgccacg 


cgcccatggt 


tcgctcggcc 


gggggcccca 


9060 


ggtgcttccc 


gatgccgatg 


cccctgctgt 


ataggtgacg 


gtctccatgt 


acatcgggca 


9120 


gcccgcgttt 


ttcatgtgcc 


tcggccacca 


gaccctcttc 


accctctgct 


tcaccgtctg 


9180 


tatctcccgt 


gtcaccgtgc 


gctctttcca 


gatcgtccgc 


gtcttcaaca 


tggccaggcg 


9240 


cctcccgcgt 


gcctacggct 


actgggtccg 


ctaccacggg 


ccctgtgtct 


tcgtggcgtc 


9300 


cttcacggtg 


ctcaagatgg 


tcatcgtggc 


gggcaacgtg 


ctggccgcga 


ccgccgagcc 


9360 


cgccgcccgc 


cccgaccccg 


atgaccccaa 


gatcgcggtt 


ctcgcctgca 


actaccacaa 


9420 


cgtgctcctg 


ttcgacacca 


gcctggaccc 


gcttctgtcc 


gtggcgggct 


tcggcttcgc 


9480 


ctacgtgggc 


aaggagctgc 


ccaccaccca 


caacgaggcc 


aagttcttca 


ccttccgcat 


9540 
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ctgtaccttc atgtctgtct 


acgagggggt 


9600 


ll l y y l^u^l. 


at" rrt" rrr arr 

□ ul w l y l a. l l 


t cert cicr tcrcrc 

x- l y c y cj uyy u 


agtgctcaac 


cttctgggcg 


r*t" t" t" nncr rr 

L L L» l y y V — LLL 


9660 




l y y l l l ^Qci y 


tgc tgcgtgg 


tcctcttcta 


cccggatcac 


aara p rrr p r rr 

d Cl L O. Lp y L- LLy 


9720 


L. L L CI L. L. u^uy 


cagcatgatt 


cagggctaca 


ccaccgggaa 


ggactagcac 


ffrrrrrrt/rffT 

l y ^ — l l l l l y y 


9780 


r't'irrrr^rfrrrr 


ctcic r p\ cm cicict 

yyu^ayayy y 


r t" r ctci Y pi c\~ ci 


ggagatggag 


accaggggtg 




984 0 




l l ci l l v ci y v — l 


l l i- y v- l y y y a 


gcagggacac 


caccccgccc 


L CL LLL LL LUG 


9900 


l l uyy lll 


r r r +■ r~- r- pi n <~r t~ 
lll LL^ayy l 


1~r1~r1~f"Tr;}rr 

LLLLLy LCtLL 


ctggccgttt 


ttacccaccc 


y LLyy LyyaL 


qqcn 


y L L- CI CI CI CI CI L, 




LyLayLLy l_ u 


tggcttgeca 


ggcactgcca 


LLLCl LyL Lciy 


10020 


rfrr^.s^a.rrrr.arT'r 
yyd.ciciyyci.yL 


rrTrrrrrTt~CTArr 
Ly y y y l y dL l 


LLLLLa Ly y y 


tctccaagac 


agagatggag 


LydayLay LL 




CacagtcgcC 


3+- r"r~rrrrt~rrfTt" 

a t - L l y y l y y L- 


cacagegggt 


gtccgcaggt 


tccggctccg 


rrrrr a rrrra t~ rr 

y y Lay l l a Ly 


1014 0 


> — i»y yoayyu l 


rTrrrTrt~rTrrrrrrr 
yy y l uyyyy^ 


i~ rr n t~ rr t~ "T nrrn 
Ly y Ly l Lyyy 


ggacatctgc 


ccggcatcat 


LLaLLLLL Ly 


10200 




L- y Ly LtuU 


L LLL-L-ayClL . L 


cccccgcccc 


ccagcttggg 


dLLLdyL l uy 


J. VJ *1 O U 


yydLLLdyLL 


lull Lyay ll 


aLyyL.LyL.yL 


ataggggctg 


cttcataaat 


yLL LdLyddL 




aaacctccct 


tgggtgaaac 


gaaggcgttt 


ccttcttgtt 


tccagaggtt 


tcccccctcc 


10380 


cccccccgtc 


gccccaagaa 


agaagactgg 


gatcagagac 


ctcagcttcc 


atttccgegt 


10440 


tgccacttct 


ganccgtgta 


etttgggeca 


attctattta 


ctgtttcgga 


ncctacacgg 


10500 


nccctttcct 


naaataggaa 


caataaacca 


ggggcacctt 


tgacncactg 


tg t agtancc 


i r» c a c\ 


aatttgacga 


taantttttt 


taaaagatta 


aattaatcng 


ataaatt 




10607 


<210> 63 

<211> 1176 

<212> DNA 

<213> Felis catus 












<400> 63 
atgggacccc 


gggccaggga 


arrt"rt~nr't"n'p 

c* y LULy\.uyu 


ttcatcatcc 


tgccgcggct 


rrtrrfrlTrflrr 
l l l y y l l y ciy 


60 


ccggctgaga 


actcagactt 


art frrrrrt" 


ggggattact 


tcctcggcgg 


rri - p 1~ i~ r a r r 

L L LL L LLaLL 


120 


ctccatgcca 


acgtgaaggg 


L d L L y Lt.^uL 


ctcaacctcc 


tgcaggtgcc 


LLciy L.yLctciy 


180 


gagtatgaaa 


taaaggtgtt 


cr rr n r "T a r rr?3 +* 
yyyv^ Lu\.y u — 


ctca tgcagg 


ccatgtgctt 


LyLoyyyydy 


240 


gagatcaata 


gccagagcag 


L L LyLLyLL L 


ggcgtgctgc 


tgggctacaa 


ad Lyy LyyctL 




gtcagctaca 


tctccaacaa 


y uwuuy ^ll 


gtgctccact 


tcccggcaaa 


ctcfP\ rr rr a r t~ rr*r 
yyo.yyci.LLy l 


360 


tccttgccca 


tccaggagga 


L LOLCty LLCIL 


tgtgtgcccc 


gtgtggtggc 


Ly LLdLLyy l 


420 


cctggcaact 


ctgagtccac 


l y Ly uu uy Ly 


gcccgcttcc 


tctctctctt 


rr1~rr't"i"rrA 

LLLLL L LLLa 


480 


cagatcacct 


acagcgccat 


l a. y Ly aLy ay 


etaegggaca 


ageagegett 


LLLyyLLL L L 


540 


ctgcccacag 


cgccgggcgc 


cgatcaccag 


ategaggeca 


tggtgcagct 


gatgttgtac 


600 


ttccgccgga 


actggatcat 


cgcgctggtg 


ageageggeg 


actgcggccg 


cgacgacagc 


660 


cagctgctca 


gcgatcgccc 


ggccggcggc 


gacacctgca 


tcgccttccg 


ggagacgctg 


720 


cccatgcccc 


agcccaacca 


ggcggtgacg 


cagtgggagc 


gccggcgcct 


gaaggecate 


780 


gtggacgagc 


agcagcggca 


gagctctgeg 


cgcgtcgtgg tcctgctgtc 
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gecaaagctg 


840 
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gtcctgcaca 


acttcttccg 


cgaggtgctc 


cgccagaacc 


tcacgggcgt 


cgtgcggatc 


900 


gcctccgagt 


cctgggccat 


cgacccggtc 


ctgcacgaca 


ggcccacgcg 


ctgcacagcc 


960 


tcctgggctg 


cacccagacc 


agcagctccg 


ggtcgtctat 


ccctggcagg 


tgaggcccca 


1020 


cccacggaga 


gtcggggcca 


cacacgcagg 


cgccgccaca 


gccctgagtg 


gttgccatgg 


1080 


agaccactgc 


cctgctctag 


cgtccccctc 


tctggccggg 


tcctgggcaa 


actggcggga 


1140 


gaggccaggg 


gacgtaccct 


gtccccagac 


acataa 






1176 



<210> 64 

<211> 391 

<212> PRT 

<213> Felis catus 

<400> 64 

Met Gly Pro Arg Ala Arg Glu Val Cys Cys Phe He He Leu Pro Arg 
15 10 15 



Leu Leu Ala Glu Pro Ala Glu Asn Ser Asp Phe Tyr Leu Ala Gly Asp 
20 25 30 



Tyr Phe Leu Gly Gly Leu Phe Thr Leu His Ala Asn Val Lys Gly He 
35 40 45 



Val His Leu Asn Leu Leu Gin Val Pro Gin Cys Lys Glu Tyr Glu He 
50 55 ' 60 



Lys Val Leu Gly Tyr Asp Leu Met Gin Ala Met Cys Phe Ala Gly Glu 
65 70 75 80 



Glu He Asn Ser Gin Ser Ser Leu Leu Pro Gly Val Leu Leu Gly Tyr 
8 5 90 95 



Lys Met Val Asp Val Ser Tyr He Ser Asn Asn Val Gin Pro Val Leu 
100 105 110 



His Phe Pro Ala Lys Glu Asp Cys Ser Leu Pro He Gin Glu Asp Tyr 
115 120 125 



Ser His Cys Val Pro Arg Val Val Ala Val He Gly Pro Gly Asn Ser 
130 135 140 



Glu Ser Thr Val Thr Val Ala Arg Phe Leu Ser Leu Phe Leu Leu Pro 
145 150 155 160 



Gin He Thr Tyr Ser Ala He Ser Asp Glu Leu Arg Asp Lys Gin Arg 
165 170 175 



Phe Pro Ala Leu Leu Pro Thr Ala Pro Gly Ala Asp His Gin He Glu 
180 185 ^ 190 



Ala Met Val Gin Leu Met Leu Tyr Phe Arg Arg Asn Trp He He Ala 
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200 
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205 



Leu Val Ser Ser Gly Asp Cys Gly Arg Asp Asp Ser Gin Leu Leu Ser 
210 215 220 



Asp Arg Pro Ala Gly Gly Asp Thr Cys lie Ala Phe Arg Glu Thr Leu 
225 230 235 240 



Pro Met Pro Gin Pro Asn Gin Ala Val Thr Gin Trp Glu Arg Arg Arg 
245 250 255 



Leu Lys Ala lie Val Asp Glu Gin Gin Arg Gin Ser Ser Ala Arg Val 
260 265 " 270 



Val Val Leu Leu Ser Pro Lys Leu Val Leu His Asn Phe Phe Arg Glu 
275 280 285 



Val Leu Arg Gin Asn Leu Thr Gly Val Val Arg lie Ala Ser Glu Ser 
290 295 300 



Trp Ala lie Asp Pro Val Leu His Asp Arg Pro Thr Arg Cys Thr Ala 
305 310 315 320 



Ser Trp Ala Ala Pro Arg Pro Ala Ala Pro Gly Arg Leu Ser Leu Ala 
325 330 335 



Gly Glu Ala Pro Pro Thr Glu Ser Arg Gly His Thr Arg Arg Arg Arg 
340 345 350 



His Ser Pro Glu Trp Leu Pro Trp Arg Pro Leu Pro Cys Ser Ser Val 
355 360 365 



Pro Leu Ser Gly Arg Val Leu Gly Lys Leu Ala Gly Glu Ala Arg Gly 
370 375 380 



Arg Thr Leu Ser Pro Asp Thr 



385 


390 


<210> 


65 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


65 


taaacaactc cacggccctg ctgc 


<210> 


66 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 



<220> 

<223> Synthetic Construct 
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<400> 66 

cccagggtga tgttgggcag cagg 



<210> 


67 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


67 



gctgtgtatg cggtggccca tggc 



<210> 


68 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


68 


ccaggagctg gtggaggcca tggg 


<210> 


69 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


69 


tgctgaccaa cctgactggc aagg 


<210> 


70 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


70 


tctgaggcga cccacacctt gcca 


<210> 


71 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


71 


ccagttcagc taaacataaa tgag 


<210> 


72 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 



24 



24 



24 



24 



24 
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<220> 

<223> Synthetic Construct 



<400> 


72 


gccactggat tttggtctca ttta 


<210> 


73 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


73 



agctaacacg ctgctgctgc tgct 



<210> 


74 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


74 


agcagtccca agcagcagca gcag 


<210> 


75 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


75 


tgtgtcacct tcagcctgct cttc 


<210> 


76 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


76 


tccaggacac gaagttgaag agca 


<210> 


77 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


77 


tacttcggcc ccaagtgcta catg 


<210> 


78 


<211> 


24 


<212> 


DNA 



24 



24 



24 



24 



24 
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<213> 



Artificial Sequence 



<220> 
<223> 



Synthetic Construct 



<400> 78 

ccgggtagaa gaggatcatg tagc 



24 



<210> 79 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

<400> 79 

tggtcaccat cgtggacctc ttgg 24 



<210> 80 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

<400> 80 

aggttgagca cagtgaccaa gagg 24 



<210> 81 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 



<210> 82 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

<400> 82 

tcatgctgag ggtgatgaac ttgg 24 



<210> 83 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 



<400> 81 



accaactaca acgaggccaa gttc 



24 



<400> 83 

tccgagtcct gggccatcga cccg 



24 



<210> 84 
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<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 84 

tgaggttgtg caggaccggg tcga 24 



<210> 


85 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


85 



tacaacctca tgcaggccat gcgc 24 



<210> 


86 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


86 



tctcctccac cgcgaagcgc atgg 24 



<210> 


87 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


87 



atcaccatcc agagcgtgcc catc 24 



<210> 


88 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


88 



actcactgaa gcccgggatg ggca 24 



<210> 


89 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


89 



accaccacgt cgaggccatg gtgc 24 
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<210> 


90 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


90 



aagtgcagca tcagctgcac catg 24 



<210> 


91 


<211> 


23 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


91 



tcrgacttct acctgcctgg rga 23 



<210> 92 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

<400> 92 

cttcacgttg gcatggaggg 2 0 



<210> 93 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

<400> 93 

tacctcctgg gtggcctctt c 21 



<210> 94 

<211> 20 

<212> DNA 

<213> Artificial S.equence 
<220> 

<223> Synthetic Construct 

<400> 94 

tcttgcacwk gggcacctgc 20 



<210> 


95 


<211> 


22 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


95 
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22 



<210> 


96 


<211> 


21 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


96 


gggcakgtag tggctgtagt c 


<210> 


97 


<211> 


22 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


97 


ggctacaacc tsatgcaggc ca 


<210> 


98 


<211> 


22 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


98 



gagttgtcag ggccaatgac eg 22 
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